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^-»1 GGATCCCAGGGAACGTGACC ATG GTC GTA GGG ATG ACT TGA CAGTTTCAACGGGGTGCGACCACCGTTGCGC 72 

yrfft V V G M Tj * 7 

5tQ li? NO: 2 

73 T CAGAAGGC AT AC GT TGGT GGAACACGTC GGAAAGC T GGGAGGTGAATC TG ATG GCT GGC GAC CAA GAG CTG 14 4 

SEC? Xi> f\):H^( H A G D Q E L 7 



1 



145 GAA CTG CGG TTC GAC GTT CCT CTT TAC ACG CTT GCC GAG GCA TCG CGG TAC CTG GTG GTT 204 
8ELRFDVPLYTLAEASRYLVV 27 

205 CCC CGC GCC ACC CTG GCT ACG TGG GCT GAC GGC TAC GAG CGT CGG CCG GCC AAC GCA CCG 264 
28 P RAT LATWADGYE R RPANAP 47 

2 65 GCG GTC CAG GGG CAA CCG ATC GCC TTT GAC GCC TAT TCG GTC GCG CAG CTT TTT GGC GAC 324 
48AV QGQP IA FDAYSVAQLFGD 67 

325 GTC ACT GGT GCC CGC GTT GCG. GGC GTC CAG CCG CAG CGA CAC CAC ATA CGG CCG GTC . CGG 38 4 
68VTGARVA'gVQPQRH HIRPVR 87 

385 TTG CGG GGG CCG TTG GGT GGG GTT GGG TGC CTC CGT CAC CCC AGG CAG TTC GCT GGC TAT 4 44 
88 L RGP LGGVGC LRHP RQFAGY 107 

4 45 TTG TCG CAG TAG CGCGACGGCATTGTCG ATG TCT TGG TAG CTAGCATCCGGTCGGGGGGCCGCTACCAGCG 515 
108 LSQ)* MSW* 4 

516 CCAGCGCCGGGGCTCCCCGGTCCGGGTAGTGCGCGTCGAGTTGGTCGTGGACCAGCA.ATG ACT GCG ACC CG£ 58 7 
1 Xt> H0:$( M T A T R 5 

5 88 CGA CTT CGA AAC CGC CAC CGG TTA GAT TCC CCG ACT GCG TCA TCG CCA GGT AAA CCG CCG 64 7 

6R L R N R H R • L 1 0 S P T A S S P ■ G • K P' P 1 ■ 25 

64 8 GCA CTA ACG CCA GCA ACC AAC CCG TGA AGAC CAACCAACGGC AC CTGC GC AGGTTGCGGCTC AACC GC AT C 718 
26 A L T P A T N P ) * 34 

719 ATG AAC TGC TGG ATT -TCG GAC TCC CCG TAC TCT CGC GCA GTG CGT GCC CGC GAG CCT ACC 778 
1MNCWIS DSPYSR AVRARE PT 20 

St QXl>N5: £ 779 GAA GAT CGC GTG CAT GCG TTC GGC GTG GAC CGC ACA GCA CCT GGA GTT GGC GGC GCC GAG 838 
21 ED RVHAFGVDRTAPGVGG AE 4 0 

839 GGC CGA GAT GGC AGG ATG ACG GAT CGT CGG GGG CGG GAA CTC CCA GGC CGC CGG ACC GTC 898 
41GRDGRMTDRRGRELPGRRTV 60 

899 GCA AAC CCG TCG CAA ACC CGT CGC AAA CCG TAA GGAGTCATCC ATG AAG ACA GGC ACC GCG 959 
61ANPSQTRRKP)* 5e Q X/> Nt?.0 f^M K T G T A 6 
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SFQ id a>o;& 

Insert of the clone containing DP428 and contained in seql 
1/1 31/11 
->GAT CGC CTT TGA CGC CTA TTC GGT CGC GCA GCT TTT TGG CGA CGT CAC TGG TGC CCG CGT 
asp arg leu OPA (arg leu phe gly arg ala ala phe trp arg arg his trp cys pro arg 
61/21 StQ Ifc NO*^ 91/31 

TGC GGG CGT . CCA GCC GCA GCG ACA CCA CAT ACG GCC GGT CCG GTT GCG GGG GCC GTT GGG 
cys gly arg pro ala ala ala thr pro his thr ala gly pro val ala gly ala val gly 
121/41 151/51 

TGG GGT TGG GTG CCT CCG TCA CCC CAG GCA GTT CGC TGG CTA TTT GTC GCA GTA GCG CGA 
trp gly trp val pro pro ser pro gin ala val arg trp leu phe val ala val ala arg 
181/61 211/71 

CGG CAT TGT CGA TGT CTT GGT AGC TAG CAT CCG GTC GGG GGG CCG CTA CCA GCG CCA GCG 
arg his cys arg cys leu gly ser)AMB(his pro val gly gly pro leu pro ala pro ala 
241/81 lb NOMC-^ 271/91 

CCG GGG CTC CCC GGT CCG GGT AGT GCG CGT CGA GTT GGT CGT GGA CCA GCA ATG ACT GCG 
pro gly leu pro gly pro gly ser ala arg arg val gly arg gly pro ala met thr ala 
301/101 331/111 

ACC CGG CGA CTT CGA AAC CGC CAC CGG TTA GAT TCC CCG ACT GCG TCA TCG CCA GGT AAA 
thr arg arg leu arg asn arg his arg leu asp ser pro thr ala ser ser pro gly lys 
361/121 391/131 

CCG CCG GCA CTA ACG CCA GCA ACC AAC CCG TGA AGA CCA ACC AAC GGC ACC TGC GCA GGT 
pro pro ala leu thr pro ala thr asn pro) OPA (arg^pro thr asn gly thr cys ala gly 
421/141 451/lSi\s£€> \& Nte>* U 

TGC GGC TCA ACC GCA TCA TGA ACT GCT GGA TTT CGG ACT CCC CGT ACT CTC GCG CAG TGC . 
cys gly ser thr ala ser) OPA (thr ala gly phe arg thr pro arg thr leu ala gin cys 
481/161 * ^S^O Ifc N6:i2. 511/171 

GTG CCC GCG AGC CTA CCG AAG ATC GCG TGC ATG CGT TCG GCG TGG ACC GCA CAG CAC CTG 
val pro ala ser leu pro lys ile ala cys met arg ser ala trp thr ala gin his leu 
'541/181* " 571/191 
GAG TTG GCG GCG CCG AGG GCC GAG ATG GCA GGA TGA .CGG ATC GTC GGG GGC GGG AAC TCC 
glu leu ala ala pro arg ala glu met ala glWOPA(arg ile val gly gly gly asn ser 
601/201 631/211 ^s^o \X> Mo: 13 

CAG GCC GCC GGA CCG TCG CAA ACC CGT CGC AAA CCC GTC GCA AAC CGT AAG GAG TCA TCC 
gin ala ala gly pro ser gin thr arg arg lys pro val ala asn arg lys glu ser ser 
661/221 691/231 

ATG AAG ACA GGC ACC GCG ACG ACG CGG CGC AGG CTG TTG GCA GTA CTG ATC GCC CTC GCG 
met lys thr gly thr ala thr thr arg arg arg leu leu ala val leu ile ala leu ala 
721/241 751/251 

TTG CCG GGG GCC GCC GTT GCG CTG CTG GCC GAA CCA TCA GCG ACC GGC GCG TCG GAC CCG 
leu pro gly ala ala val ala leu leu ala glu pro ser ala thr gly ala ser asp pro 
781/261 811/271 

TGC GCG GCC AGC GAA GTG GCG AGG ACG GTC GGT TCG GTC GCC AAG TCG ATG GGC GAC TAC 
cys ala ala ser glu val ala arg thr val gly ser val ala lys ser met gly asp tyr 
841/281 871/291 

CTG GAT TCA CAC CCA GAG ACC AAC CAG GTG ATG ACC GCG GTC TTG CAG CAG CAG GTA GGG 
leu asp ser his pro glu thr asn gin val met thr ala val leu gin gin gin val gly 
901/301 931/311 

CCG GGG TCG GTC GCA TCG CTG AAG GCC CAT TTC GAG GCG AAT CCC AAG GTC GCA TCG GAT 
pro gly ser val ala ser leu lys ala his phe glu ala asn pro lys val ala ser asp) 
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FIGURE 1A' 



1 



3/185 



Insert of the clone containing DP428, other reading frame 
2/1 32/11 
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arg 
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arg 
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GGC ATT 


GTC 
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GTA 


GCT 


AGC 


gly ile 


val 


asp 


val 
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val 


ala 


ser 


242/81 
















CGG GGC 


TCC 


CCG 


GTC 


CGG 


GTA 


GTG 


CGC 


arg gly 


ser 


pro 


val 


arg 


val 


val 


arg 


302/101 
















CCC GGC 
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TTC 


GAA 


ACC 


GCC 


ACC 


GGT 


pro gly 


asp 


phe 


glu 


thr 


ala 


thr 


giy) 
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NO: i%1 
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422/141 
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482/161 
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542/181 
















AGT TGG 
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pro 


val 



92/31 

CGG CCG GTC CGG TTG CGG GGG CC 
arg pro val arg leu arg gly pj 
152/51 

TTC GCT GGC TAT TTG TCG CAG Ti 
phe ala gly tyr leu ser gln)(Al 
212/71 SeQXbNo: IL-^^ 

CGG TCG GGG GGC CGC TAC CAG Ci 
arg ser gly gly arg tyr gin a 
272/91 



gin gin) OPA^Leu arg 




332/111 S&QJAt/V. 
TAG ATT CCC CGA CTG CGT CAT CGC CAG GTA AAC 
ile pro arg leu arg his arg gin val asn 
392/131 

GAA GAC CAA CCA ACG GCA CCT GCG CAG GTT 
glu asp gin pro thr ala pro ala gin val 
452/151 

TTC GGA CTC CCC GTA CTC TCG CGC AGT GCG 
phe gly leu pro val leu ,ser arg.„.se.r -ala 
512/171 

TGC GTT CGG CGT GGA CCG CAC AGC ACC TGG 
cys val arg arg gly pro his ser thr trp 
572/191 

GAT GAC GGA TCG TCG GGG GCG GGA ACT CCC 
asp asp gly ser ser gly ala gly thr pro^ 
632/211 

AAC CCG TCG CAA ACC GTA AGG AGT CAT CCA 
asn pro ser gin thr val arg ser his pro 
D 662/221 " ----- 692/231 

^Sv^v' TGA AGA CAG GCA CCG CGA CGA CGC GGC GCA GGC TGT TGG CAG TAC TGA TCG CCC TCG CGT 
r i^A^arg gin ala pro arg arg arg gly ala gly cys trp gin tyr) OPAjtser pro ser arg 

722/241 752/251 <£&&>HO\'2.l 

TGC CGG GGG CCG CCG TTG CGC TGC TGG CCG AAC CAT CAG CGA CCG GCG CGT CGG ACC CGT 
cys arg gly pro pro leu arg cys trp pro asn his gin arg pro ala arg arg thr arg 
782/261 812/271 ^ 

GCG CGG CCA GCG AAG TGG CGA GGA CGG TCG GTT CGG TCG CCA AGT CGA TGG GCG ACT ACC ■ 
ala arg pro ala lys trp arg gly arg ser val arg ser pro ser arg trp ala thr thr i_j 
842/281 872/291 ^ 

TGG ATT CAC ACC CAG AGA CCA ACC AGG TGA TGA -CCG CGG TCT TGC AGC AGC AQG TAG GGC 
trp ile his thr gin arg pro thr arg) OPA OPA(pra arg ser cys ser ser arcfl AMB(gly 
902/301 932/3lf s ^Q£bi4D:aa 

CGG GGT CGG TCG CAT CGC TGA AGG CCC ATT TCG AGG CGA ATC CCA AGG TCG CAT CGG ATC 
arg gly arg ser his arcMOPA^arg pro ile ser arg arg ile pro arg ser his arg ile^ 



SEQ ID ^03'. /V"^ 



FIGURE IB' 
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SeqlC: Insert of the DP428 clone, other reading frame 
3/1 33/11 

TCG CCT TTG ACG CCT ATT CGG TCG CGC AGC TTT TTG GCG ACG TCA CTG GTG CCC GCG TTG 
(ser pro leu thr pro ile arg ser arg ser phe leu ala thr ser leu val pro ala leu 
63/21 93/31 

CGG GCG TCC AGC CGC AGC GAC ACC ACA TAC GGC CGG TCC GGT TGC GGG GGC CGT TGG GTG 
arg ala ser ser arg ser asp thr thr tyr gly arg ser gly cys gly gly arg trp val 
123/41 153/51 

GGG TTG GGT GCC TCC GTC ACC CCA GGC AGT TCG CTG GCT ATT TGT CGC AGT AGC GCG ACG 
gly leu gly ala ser val thr pro gly ser ser leu ala ile cys arg ser ser ala thr 
183/61 213/71 

GCA TTG TCG ATG TCT TGG TAG CTA GCA TCC GGT CGG GGG GCC GCT ACC AGC GCC AGC GCC 
ala leu ser met ser trp)AMB(leu ala ser gly arg gly ala ala thr ser ala ser ala 
243/81 ^^^^11273/91 

GGG GCT CCC CGG TCC GGG TAG TGC GCG TCG AGT TGG TCG TGG ACC AGC AAT GAC TGC GAC 
gly ala pro arg ser glv^AMB (cys ala ser ser trp ser trp thr ser asn asp cys asp 
303/101 sebXbNO'.-XV-J* 333/111 

CCG GCG ACT TCG AAA CCG CCA CCG GTT AGA TTC CCC GAC TGC GTC ATC GCC AGG TAA ACC 
pro ala thr ser lys pro pro pro val arg phe pro asp cys val ile ala ar^> 
363/121 393/131 3^Xb>>J0\ 

GCC GGC ACT AAC GCC AGC AAC CAA CCC GTG AAG ACC AAC CAA CGG CAC CTG CGC AGG TTG 
ala gly thr asn ala ser asn gin pro val lys thr asn gin arg his leu arg arg leu 
423/141 453/151 

CGG CTC AAC CGC ATC ATG AAC TGC TGG ATT TCG GAC TCC CCG TAC TCT CGC GCA GTG CGT 
arg leu asn arg ile met asn. cys trp ile ser asp ser pro tyr ser arg ala val arg 
483/i6i J ' . ... 513 / 17 i 

GCC CGC GAG CCT ACC GAA GAT CGC GTG CAT GCG TTC GGC GTG GAC CGC ACA GCA CCT GGA 
ala arg glu pro thr glu asp arg val his ala phe gly val asp arg thr ala pro gly 
543/181 573/191 

GTT GGC GGC GCC GAG GGC CGA GAT GGC AGG ATG ACG GAT CGT CGG GGG CGG GAA CTC CCA 
val gly gly ala glu gly arg asp gly arg met thr asp arg arg gly arg glu leu pro 
603/201 633/211 

GGC CGC CGG ACC GTC GCA AAC CCG TCG CAA ACC CGT CGC AAA CCG TAA GGA GTC ATC CAT 
qly arq arg thr val ala asn pro ser gin thr arg arg lys pro) -OCHfgly val ile his 
663/221 693/231 7 ^ #K>%t> NO\ %T> 

GAA GAC AGG CAC CGC GAC GAC GCG GCG CAG GCT GTT GGC AGT ACT GAT CGC CCT CGC GTT 
glu asp arg his arg asp asp ala ala gin ala val gly ser thr asp arg pro arg val 
723/241 753/251 

GCC GGG GGC CGC CGT TGC GCT GCT GGC CGA ACC ATC AGC GAC CGG CGC GTC GGA CCC GTG 
ala gly gly arg arg cys ala ala gly arg thr ile ser asp arg arg val gly pro val 
783/261 813/271 

CGC GGC CAG CGA AGT GGC GAG GAC GGT CGG TTC GGT CGC CAA GTC GAT GGG CGA CTA CCT 
arg gly gin arg ser gly glu asp gly arg phe gly arg gin val asp gly arg leu pro 
843/281 873/291 

GGA TTC ACA CCC AGA GAC CAA CCA GGT GAT GAC CGC GGT CTT GCA GCA GCA GGT AGG GCC 
gly phe thr pro arg asp gin pro gly asp asp arg gly leu ala ala ala gly arg ala 
903/301 933/311 

GGG GTC GGT CGC ATC GCT GAA GGC CCA TTT CGA GGC GAA TCC CAA GGT CGC ATC GGA TC 
gly val gly arg ile ala glu gly pro phe arg gly glu ser gin gly arg ile glyj 



SEQ id j^>$: 95- 3>C? 
FIGURE lC f 
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Coding sequence DP428 identical to the Rv0203 predicted by Cole et al . 
(Nature 393:537-544) 

1/1 31/11 
1ATG AAG ACA GGC ACC GCG ACG ACG CGG CGC AGG CTG TTG GCA GTA CTG ATC GCC CTC GCG 

Leu ala val leu ile ala leu ala 



_^VMet lys thr gly thr ala thr thr arg arg arg leu le 
A 61/21 91/31 



TTG CCG GGG GCC GCC GTT GCG CTG CTG GCC GAA CCA TCA GCG ACC GGC GCG TCG GAC CCG 
leu pro gly ala ala val ala leu leu ala glu pro ser ala thr gly ala ser asp pro 
121/41 151/51 

TGC GCG GCC AGC GAA GTG GCG AGG ACG GTC GGT . TCG GTC GCC AAG TCG ATG GGC GAC TAC 
cys ala ala ser glu val ala arg thr val gly ser val ala lys ser met gly asp tyr 
181/61 211/71 

CTG GAT TCA CAC CCA GAG ACC AAC CAG GTG ATG ACC GCG GTC TTG CAG CAG CAG GTA GGG 
leu asp ser his pro glu thr asn gin val met thr ala val leu gin gin gin val gly 
241/81 271/91 

CCG GGG TCG GTC GCA TCG CTG AAG GCC CAT TTC GAG GCG AAT CCC AAG GTC GCA TCG GAT 
pro gly ser val ala ser leu lys ala his phe glu ala asn pro lys val ala ser asp 
301/101 331/111 

CTG CAC GCG CTT TCG CAA CCG CTG ACC GAT CTT TCG ACT CGG TGC TCG CTG CCG ATC AGC 
leu his ala leu ser gin pro leu thr asp leu ser thr arg cys ser leu pro ile ser 
361/121 391/131 

GGC CTG CAG GCG ATC GGT TTG ATG CAG GCG GTG CAG GGC GCC CGC CGG TAG 
gly leu gin ala ile gly leu met gin ala val gin gly ala arg arg^AMB 

SEQ id f4D$* 31- 32j 
, FIGURE ID' , 



ORF containing the DP428 sequence and forming part of seqlA 1 
1/1 31/11 
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CAG GGC 


GCC 


CGC 


CGG 


TAG 




























gin gly 


ala 


arg 


arg 


\AMB 





























seq id N/05 133- 3^ 



FIGURE IE' 



4> 



.V? . 6/185 



^ — >51 91 CCGGTCGGGGGGCCGCTACCAGCGCCAGCGCCGGGGCTCCCCGGTCCGGGTA GTG CGC GTC GAG TTG GTC GTG 563 
l c££fJX»AD*. ^(v R V E L V V 7 

564 GAC CAG CAA TGA CTGCGACCCGGCGACTTCGAAACCGCCACCGGTTAGATTCCCCGACTGCGTCATCGCCAGGTAA 63 9 



/ ^jj jDI L3rt^_ ^no unn i 
8 D Q qN ) * 



64 0 ACCGCCGGCACT AACGCCAGCAACCAACCC GTG AAG ACC AAC CAA CGG CAC CTG CGC AGG TTG CGG 7 05 

i sgnjrrbvKX - ^ _ ^(y kt nqrhlrrlr 12 

7 06 CTC AAC CGC ATC ATG AAC TGC TGG ATT TCG GAC TCC CCG TAC TCT CGC GCA GTG CGT GCC 7 65 

13LNRIMNCWISDSPYSRAVRA 32 



7 66 CGC GAG CCT ACC GAA GAT CGC GTG CAT GCG 
33 R E PT E DRV HA 

82 6 GGC GGC GCC GAG GGC CGA GAT GGC AGG ATG 
53 G G A E G R D G R M 

8 86 CGC CGG ACC GTC GCA AAC CCG TCG CAA ACC 
73RRTVANPSQT 

94 7 ACA GGC ACC GCG ACG ACG CGG CGC AGG CTG 
3TGTATTRRRL 

1007 GGG GCC GCC GTT GCG CTG CTG GCC GAA CCA 
23GAAVALLAEP 

1067 GCC AGC GAA GTG GCG AGG ACG GTC GGT TCG 
43 AS EVARTVGS 

1127 TCA CAC CCA GAG ACC AAC CAG GTG ATG ACC 
63 S H P E T N Q V M T 

1187 TCG GTC GCA TCG CTG AAG GCC CAT TTC GAG 
83 SVASLKAHFE 

1247 GCG CTT TCG CAA CCG CTG ACC GAT CTT TCG 
103 ALSQPLTDLS 



TTC GGC GTG GAC CGC ACA GCA CCT GGA GTT 82 5 

FGVDRTA PGV 52 

ACG GAT CGT CGG GGG CGG GAA CTC CCA GGC 885 

TDRRGR ELPG 72 

CGT CGC AAA CCG TAA GGAGTCATCC ATG AAG 94 6 

R R K p) * xxxxxx ^(h K 2 

TTG GCA GTA CTG ATC GCC CTC GCG TTG CCG 1006 

LAVLIAL ALP 22 

TCA GCG ACC GGC GCG TCG GAC CCG TGC GCG 1066- 

SATGASDPCA 42 

GTC GCC AAG TCG ATG GGC GAC TAC CTG GAT 1126 

VAKSMGDYLD 62 

GCG GTC TTG CAG CAG CAG GTA GGG CCG GGG 1186 

AVLQQ QVGPG 82 

GCG AAT CCC AAG GTC GCA TCG GAT CTG CAC 124 6 

ANPKVASDLH 102 

ACT CGG TGC TCG CTG CCG ATC AGC GGC CTG 1306 

TRCSLPISGL 122 



1307 CAG GCG ATC GGT TTG ATG CAG GCG GTG CAG 
123 QAIGLMQAVQ 

1367 CGG GTC CGG CGC AGT CGA CGT GAG GCA GCG 
6RVRRSRREAA 

14 27 TCT GGT CGC AGG TCA GGG GTC GGC GCT GGA 
26 SGRRSGVGAG 

14 87 CAG GGT GTG CCC TGC GGT TGG ATG ACA AGT 
46QGVPCGWMTS 

1547 TCG TTG T 
66 S lA 



GGC GCC CGC CGG TAG ATG CCG GAC CGC CGC 1 366 
G A R R ] * -^C M P D R R 5 

GTC GCC TAC CGG GGC GGT GTC TCG CCG CCT 14 2 6 
VAYRGGVSPP 25 

CCT TGC GGT GTG GTT TCG ACC GGG TCG TCG 14 8 6 
PCGVVSTGSS 45 

CGC AGG TTT GGA TCG GTT GGC GGG TCG CGA 154 6 
RRFGSVGGSR 65 

1553 
67 



seq id HDS: 

FIGURE 2 



7/185 



° 31/11 



# TCG CCG GCT CGC GGA CGT AGA TAA TAG CTC ACC GTT GGA CGA CCT CGA CAG GGT CCT TTG 
M (ser pro ala arg gly arg arg)oCH AMBflleu thr val gly arg pro arg gin gly pro leu) 

.oa 61/21 see*xbND-4V 9i/3i 

TGA CTG CCG GGC TTG ACG CGG ACG ACC ACA GAG TCG GGT CAT CGC CTA AGG CTA CCG TTC 

OPA^leu pro gly leu thr arg thr thr thr glu ser gly his arg leu arg leu pro phe) 
12 1/4 1 1 ^ (?6*Xl>ls/o: WM 151/51 

. TGA CCT GGG GTG CGT GGG CGC CGA CGA GTG AGG CAG TCA TGT CTC AGG GCC CAC CGC CAC 

OPA^ro gly val arg gly arg arg arg val arg gin ser cys leu arg ala his arg his 
181/61^ 211/71 

CTC GGT CGC CGG CAG TGT CAG CAT GTG CAG ATG ACT CCA CGC AGC TTG TTC GTG TTG' GTG 

leu qly arq arq qln cys gin his val gin met thr pro arg ser leu phe val leu val 
241/81 271/91 

TCG TGG TTG CGA CGA CTT GGC GCT GGT GAG CGC ACC CGC CGG CGT CGT GCC GCG CAT GCG 

ser trp leu arg arg leu gly ala gly glu arg thr arg arg arg arg ala ala his "ala 

301/101 

GAT C 

asp) 

SEQ ID NOS'. 41- ^5 
. • ■ , , .„ FIGURE 3A 



32/11 

^CGC CGG CTC GCG GAC GTA GAT AAT AGC TCA CCG TTG GAC GAC CTC GAC AGG GTC CTT TGT 

*\S{Vy/arg arg leu ala asp val asp asn ser. ser pro leu asp asp leu asp arg val leu cys 
^62/21 92/31 

GAC TGC CGG GCT TGA CGC GGA CGA CCA CAG AGT CGG GTC ATC GCC TAA GGC TAC CGT TCT 

asp cys arg ala^OPAfarg gly arg pro gin ser arg val ile alaj OCH (gly tyr arg ser 
122/41 U *-S#Mb*6\*<8 152/51 ^Q^C> HO\ ^ 

GAC CTG GGG TGC GTG GGC GCC GAC GAG TGA GGC AGT CAT GTC TCA GGG CCC ACC GCC ACC 

asp leu gly cys val gly ala asp glu) OPA (gly ser his val ser gly pro thr ala thr 
182/61 36&31>M0'.5D_1^^212/71 

TCG GTC GCC GGC AGT GTC AGC ATG TGC AGA TGA CTC CAC GCA GCT TGT TCG TGT TGG TGT 

ser val ala gly ser val ser met cys arg*)OPA (^eu his ala ala cys ser cys trp cys 
242/81 y 272/91 * j;t> ho\ S I 

CGT GGT TGC GAC GAC TTG GCG CTG GTG AGC GCA CCC GCC GGC GTC GTG CCG CGC ATG CGG 

arg gly cys asp asp leu ala leu val ser ala pro ala gly val val pro arg met arg 

302/101 

ATC 

ile) 



SEQ ID NDS'. M6?-5 ( 
FIGURE 3B 



8/185 

33/11 

^ ^GCC GGC TCG CGG ACG TAG ATA ATA GCT CAC CGT TGG ACG ACC TCG ACA GGG T.CC TTT GTG 

^ ut r^7>(ala gly ser arg thr\AMB(ile ile ala his arg trp thr thr ser thr gly ser phe val 
f?Q^ 63/21 / \**£QXbtiO:SH 93/31 

& ACT GCC GGG CTT GAC GGG GAG GAC CAC AGA GTC GGG TCA TCG CCT AAG GCT ACQ GTT CTG 

thr ala gly leu asp ala asp asp his arg val gly ser ser pro lys ala thr val leu 
123/41 153/51 

ACC TGG GGT GCG TGG GCG CCG ACG AGT GAG GCA GTC ATG TCT CAG GGC CCA CCG CCA CCT 
thr trp gly ala trp ala pro thr ser glu ala val met ser gin gly pro pro pro pro 
183/61 213/71 

CGG TCG CCG GCA GTG TCA GCA TGT GCA GAT GAC TCC ACG CAG CTT GTT CGT GTT GGT GTC 
arg ser pro ala val ser ala cys ala asp asp ser thr gin leu val arg val gly val 
243/81 273/91 

GTG GTT GCG ACG ACT TGG CGC TGG TGA GCG CAC CCG CCG GCG TCG TGC CGC GCA TGC GGA 
val val ala thr thr trp arg trploPA^ala his pro pro ala ser cys arg ala cys gly) 

^seqzcb AID'.SS 

TC 



SEQ ID hi03\ 50.-55 
FIGURE 3C 



$6R 31/11 

^ CCA ATT TTC CTT CGC GCC GTG CAA TAC CAT CTG CAA GAC CAG CGA CGG CCC GTG GTT GCG 
>fpro ile phe leu arg ala val gin tyr his leu gin asp gin arg arg pro val val ala 
^61/21 91/31 
~<&^* GTC GCG CAG CTT GCG GAA ACC GGG TAT GGA CCC TGC CGT ACC GTT GTT GCC ACT TGA TGT 

val ala gin leu ala glu thr gly tyr gly pro cys arg thr) val val ala thr OPA^cys 

121/41 151/51 5«>XbfVCr,5* — ^ 

CGT CGC TCT CCA CCC GTC GGG GGG CGA AAG CCA TTC CGA CAC TGG GAT CCT CAA AAC GTC 
arg arg ser pro pro val gly gly arg lys pro phe arg his trp asp pro gin asn val 
181/61 211/71 

GGC TGA GTG TCT GCA GGG CTC CGG GGA GCA GCC GAT CAT CAC CAT GTA CGA ACT GAA TAA 
glyloPAfval ser ala gly leu arg gly ala ala asp his his his val arg thr glu) OCH 
241/81 V ^^X(^ND^ 271/91 ' 

n GTC CCC CGC GCG CGA CTT CCA GAC ATT TGT TGT GGT TTC GGT TGA GGC CGA GGC GAG GCT 
^5-^0|(val pro arg ala arg leu pro asp ile cys cys gly phe glw OPAigly arg gly glu ala 
301/101 331/111 ^Z&ZX&NO: M 

CAT TTC GCA GCA ACC GGT CTC CGG GTC GCA GCA TCG TTG CGG CGA TCG CGG CGC AGT CGT 
his phe ala ala thr gly leu arg val ala ala ser leu arg arg ser arg arg ser arg 
361/121 

CGG ACG AGT CGT CGT CAA CGA CCA CGA TC 
arg thr ser arg arg gin arg pro arg) 



SEQ ID }4DS'-SG>~&>\ 



FIGURE 4A 



9/185 



^ 32/11 

^CAA TTT TCC TTC GCG CCG TGC AAT ACC ATC TGC AAG ACC AGC GAC GGC CCG TGG TTG CGG 
glnfphe ser phe ala pro cys asn thr ile cys lys thr ser asp gly pro trp leu arg 
62/21 FL $E& :c C> HU\&& 92/31 

TCG CGC AGC TTG CGG AAA CCG GGT ATG GAC CCT GCC GTA CCG TTG TTG CCA CTT GAT GTC 
ser arg ser leu arg lys pro gly met asp pro ala val pro leu leu pro leu asp val 
122/41 152/51 

GTC GCT CTC CAC CCG TCG GGG GGC GAA AGC CAT TCC GAC ACT GGG ATC CTC AAA ACG TCG 
' val ala leu his pro ser gly gly glu ser his ser asp thr gly ile leu lys thr ser 
182/61 212/71 

GCT GAG TGT CTG CAG GGC TCC GGG GAG CAG CCG ATC ATC ACC ATG TAC GAA CTG AAT AAG 
ala glu cys leu gin gly ser gly glu gin pro ile ile thr met tyr glu leu asn lys 
242/81 , 272/91 

TCC CCC GCG CGC GAC TTC CAG ACA TTT GTT GTG GTT TCG GTT GAG GCC GAG GCG AGG CTC 
ser pro ala arg asp phe gin thr phe val val val ser val glu ala glu ala arg .leu 
302/101 332/111 

ATT TCG CAG CAA CCG GTC TCC GGG TCG CAG CAT CGT TGC GGC GAT CGC GGC GCA GTC GTC 
ile ser gin gin pro val ser gly ser gin his arg cys gly asp arg gly ala val val 
362/121 

GGA CGA GTC GTC GTC AAC GAC CAC GAT C 
gly arg val val val asn asp his asp) 

seq id nds; G?a-&3> 

FIGURE 4B 

33/11 



AAT TTT 


CCT 


TCG 


CGC 


CGT 


GCA 


ATA 


CCA 


TCT 


GCA AGA 


CCA 


GCG 


ACG 


GCC 


CGT 


GGT 


TGC 


GGT 


asn phe 


pro 


ser 


arg 


arg 


ala 


ile 


pro 


ser 


ala arg 


pro 


ala 


thr 


ala 


arg 


gly 


cys 


gly 


*63/21 . 
















93/31 


















CGC GCA 


GCT 


TGC 


GGA 


AAC 


CGG 


GTA 


TGG 


ACC 


CTG CCG 


TAC 


CGT 


TGT 


TGC 


CAC 


TTG 


ATG 


TCG 


arg ala 


ala 


cys 


gly 


asn 


arg 


val 


trp 


thr 


leu pro 


tyr 


arg 


cys 


cys 


his 


leu 


met 


ser 


123/41 


















153/51 


















TCG CTC 


TCC 


ACC 


CGT 


CGG 


GGG 


GCG 


AAA 


GCC 


ATT CCG 


ACA 


CTG 


GGA 


TCC 


TCA 


AAA 


CGT 


CGG 


ser leu 


ser 


thr 


arg 


arg 


gly 


ala 


lys 


ala 


ile pro 


thr 


leu 


gly 


ser 


ser 


lys 


arg 


arg 


183/61 


















213/71 


















CTG AGT 


GTC 


TGC 


AGG 


GCT 


CCG 


GGG 


AGC 


AGC 


CGA TCA 


TCA 


CCA 


TGT 


ACG 


AAC 


TGA 


ATA 


AGT 


leu ser 
243/81 


val 


cys 


arg 


ala 


pro 


gly 


ser 


ser 


arg ser 
273/91 


ser 


pro 


cys 


thr 


asm 


1 OPA! 


[ile ser 


CCC CCG 


CGC 


GCG 


ACT 


TCC 


AGA 


CAT 


TTG 


TTG 


TGG TTT 


CGG 


TTG 


AGG 


CCG 


AGG 


CGA 


GGC 


TCA 


pro pro 


arg 


ala 


thr 


ser 


arg 


his 


leu 


leu 


trp phe 


arg 


leu 


arg 


pro 


arg 


arg 


gly 


ser 


303/101 
















333/111 
















TCG 


TTT CGC 


AGC 


AAC 


CGG 


TCT 


CCG 


GGT 


CGC 


AGC 


ATC GTT 


GCG 


GCG 


ATC 


GCG 


GCG 


CAG 


TCG 


phe arg 


ser 


asn 


arg 


ser 


pro 


gly 


arg 


ser 


ile val 


ala 


ala 


ile 


ala 


ala 


gin 


ser 


ser 


363/121 




































GAC GAG 


TCG 


TCG 


TCA 


ACG 


ACC 


ACG 


ATC 






















asp glu 


ser 


ser 


ser 


thr 


thr 


thr 


ile} 





















SEQ ID MOS' <bM-fc»fc 



FIGURE 4C 
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part of the nucleotide sequence of seq4A 
( 1/1 31/11 

VCCG CGC GCG ACT TCC AGA CAT TTG TTG TGG TTT CGG TTG AGG CCG AGG CGA GGC.TCA TTT 



♦U^fpro arg ala thr ser arg his leu leu trp phe arg leu arg pro arg arg gly ser phe 
WV * 6 i/2l 91/31 

CGC AGC AAG CGG TCT CCG GGT CGC AGC ATC GTT GCG GCG ATC GCG GCG CAG TCG TCG GAC 
^ arg ser lys arg ser pro gly arg ser ile val ala ala ile ala ala gin ser ser asp 

121/41 

GAG TCG TCG TCA ACG ACC ACG ATC 
glu ser ser ser thr thr thr ilej 

SEQ ID flsjC£? 6?7-6^ 
FIGURE 4A f 

,1/1 31/11 

V^CGC GCG CGA CTT CCA GAC ATT TGT TGT GGT TTC GGT TGA GGC CGA GGC GAG GCT CAT TTC 
»farq ala arg leu pro asp ile cys cys gly phe gly)OPA(gly arg gly Glu ala" his' phe 
61/21 91/31 ' K *3eSSbNXXll 

j& GCA GCA AGC GGT CTC CGG GTC GCA GCA TCG TTG CGG CGA TCG CGG CGC AGT CGT CGG ACG 

>§P ala ala ser gly leu arg val ala ala ser leu arg arg ser arg arg ser arg arg thr 

& 121/41 

AGT CGT CGT CAA CGA CCA CGA TC 
ser arg arg gin arg pro arg^ 

SEQ ID M05 f . 67-7/ 
FIGURE 4B f 

I 1/1 31/11 

V^GCC GCG CGC GAC TTC CAG ACA TTT GTT GTG GTT TCG GTT GAG GCC GAG GCG AGG CTC ATT 
^(ala ala arg asp phe gin thr phe val val val ser val glu ala glu ala arg leu ile 
-^.^ 61/21 91/31 

TCG CAG CAA GCG GTC TCC GGG TCG CAG CAT CGT TGC GGC GAT CGC GGC GCA GTC GTC GGA 
47 ser gin gin ala val ser gly ser gin his arg cys gly asp arg gly ala val val gly 

121/41 

CGA GTC GTC GTC AAC GAC CAC GAT C 
arg val val val asn asp his asp 



SEQ ID MT>5\T 3^-T2> 
FIGURE 4C 
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ORF according to Cole et al . (Nature 393:537-544) and containing the sequence 
Seq 4A' 



1/1 


















31/11 


















^jjtga ata 


agt 


ccg 


ccg 


cgc 


gcg 


act 


tec 


aga 


cat ttg 


ttg 


tgg 


ttt 


egg 


ttg 


agg 


ccg 


agg 


OPAfile 


ser 


pro 


pro 


arg 


ala 


thr 


ser 


arg 


his leu 


leu 


trp 


phe 


arg 


leu 


arg 


pro 


arg 


61/21 ^-5>eUrb Mb\15 












91/31 


















cga ggc 


tea 


ttt 


cgc 


age 


aag 


egg 


tct 


ccg 


ggt cgc 


age 


ate 


gtt 


gcg 


gcg 


ate 


gcg 


gcg 


arg gly 


ser 


phe 


arg 


ser 


lya 


arg 


ser 


pro 


gly arg 


ser 


ile 


val 


ala 


ala 


ile 


ala 


ala 


121/41 


















151/51 


















cag teg 


teg 


gac 


gag 


teg 


teg 


tea 


acg 


acc 


acg ate 


teg 


aac 


teg 


acg 


ccc 


tec 


tgt 


teg 


gin ser 


ser 


asp 


glu 


ser 


ser 


ser 


thr 


thr 


thr ile 


ser 


asn 


ser 


thr 


pro 


ser 


cys 


ser 


. 181/61 


















211/71 












atg 






agg atg 


eta 


cgc 


aga 


cag 


cgc 


teg 


atg 


gtg 


gcg ccg 


ttg 


ttg 


tac 


ate 


ggg 


cac 


acc 


arg met 


leu 


arg 


arg 


gin 


arg 


ser 


met 


val 


ala pro 


leu 


leu 


tyr 


ile 


gly 


met 


his 


thr 


241/81 


















271/91 


















gag ata 


age 


ggt 


ttc 


gee 


ggg 


ttc 


ace 


gat 


acc acg 


ctt 


gat 


gca 


tea 


cca 


ggc 


acc 


aca 


glu ile 


ser 


gly 


phe 


ala 


gly 


phe 


thr 


asp 


thr thr 


leu 


asp 


ala 


ser 


pro 


gly 


thr 


thr 



301/101 

tgg cga etc aga gac tag 
trp arg leu arg asp)AMB 



seq id 7M-75 

FIGURE 4t>' 



«Q sequence upstream of seq4A* and fused with seq4A' 
f l/l 31/11 

V GCA ACC TAC CAG CAG AGC CAG GGG CTC ACA GGA CCT AAA GGA GTA GCG CCC ATG GCT GAT 
.^(ala thr tyr gin gin ser gin gly leu thr gly pro lys gly val ala pro met ala asp 

.<p C 

eP" SEQ ID tfDS: 76-77 



. V? FIGURE 4E' 

seq4J' in another reading frame 
1/1 31/11 
^JKCG CAA CCT ACC AGC AGA GCC AGG GGC TCA CAG GAC CTA AAG GAG TAG CGC CCA TGG CTG 
^ifthr gin pro thr ser arg ala arg gly ser gln)asp(leu lys glu AMB arg pro trp leu 

/y 61/21 ^Gnt>WD:<*p 

ATC 



0 



1J 



SEQ ID U0S.l%-^O 



^ FIGURE 4V 



I seq 4 J 1 in the third reading frame 
/ 1/1 31/11 

VCGC AAC CTA CCA GCA GAG CCA GGG GCT CAC AGG ACC TAA AGG AGT AGC GCC CAT GGC TGA 
_/arg asn leu pro ala glu pro gly ala his arg thr)OCH (arg ser ser ala his glylOPA 

TC 

SEQ ID f#)S*. ^f-ra 
FIGURE 4$' 



12/185 



sequence Rv2050 predicted by Cole 

seq4 J 

1/1 

,^U.>JATG GCT GAT CGT GTC CTG AGG GGC AGT 
€l6t^ ^ (Met ala asp arg val leu arg gly ser 

■ 61/21 

„ AAC CAC GAC CTG GCG CCG CGC CAG ATC 
asn his asp leu ala pro arg gin ile 
121/41 

GAA GTC CCG TTC GCC GAT GAC GCC GAG 
glu val pro phe ala asp asp ala glu 
181/61 

GAA GGC ACC CTG ATC GAG GGC GAC CTG 
glu gly thr leu ile glu gly asp leu 
241/81 

CAC TGG GAC ATG CTG CTG GAG CGC CGT 
his trp asp met leu leu glu arg arg 
301/101 

CGC CTC GAG CTC ATT CGG TCA CGT CGG 
arg leu glu leu ile arg ser arg arg 



et al. (Nature 393:537-544) and containing 
31/11 

CGC CTC GGA GCC GTG AGC TAT GAG ACC GAC CGC 
arg leu gly ala val ser tyr glu thr asp arg 
91/31 

GCG CGG TAC CGC ACC GAC AAC GGC GAG GAG TTC 
ala arg tyr arg thr asp asn gly glu glu phe 
151/51 

ATC CCC GGC ACC TGG TTG TGC CGC AAC GGC ATG 
ile pro gly thr trp leu cys arg asn gly met 
211/71 

CCC GAG CCG AAG AAG GTT AAG CCG CCC CGG ACG 
pro glu pro lys lys val lys pro pro arg thr 
271/91 

TCC ATC GAA GAA CTC GAA GAG TTA CTT AAG GAG 
ser ile glu glu leu glu glu leu leu lys glu 

331/111 
CGC GGC TGA 
arg gly)OPA 



seq id HO* : Vi-%S 

FIGURE 4\V 

ORF according to Cole et al . (Nature 393:537-544) and containing the sequence 
Rv2050 

1/1 31/11 
«<g fc — j^TAG TCC GCC CGG GTG TCC GAT CCC GGT ATC ATT GAT GGT CGC GCC GCG CGC GTC GCG TGC 

*JZ&^ * AMB 4 er ala ar 9 val ser as P P ro 9 lv ile ile as P 9 lv ar 9 ala ala ar 9 val ala cvs 

61/21 *W^IC> Wt>\ %1 91/31 

CGG GAA CTA CGC AGA CGG CCG CAG CGT TTG CCA ACC GGA GCC AGT CGC CAG TAC GCA ACC 
arg glu leu arg arg arg pro gin arg leu pro thr gly ala ser arg gin tyr ala thr 
121/41 151/51 

TAC CAG CAG AGC CCA GGG CTC ACA GGA CCT AAA GGA GTA GCG CCC ATG GCT GAT CGT GTC 
tyr gin gin ser pro gly leu thr gly pro lys gly val ala pro met ala asp arg val 
181/61 211/71 

CTG AGG GGC AGT CGC CTC GGA GCC GTG AGC TAT GAG ACC GAC CGC AAC CAC GAC CTG GCG 
leu arg gly ser arg leu gly ala val ser tyr glu thr asp arg asn his asp leu ala 
241/81 271/91 

CCG CGC CAG ATC GCG CGG TAC CGC ACC GAC AAC GGC GAG GAG TTC GAA GTC CCG TTC GCC 
pro arg gin ile ala arg tyr arg thr asp asn gly glu glu phe glu val pro phe ala 
301/101 331/111 

GAT GAC GCC GAG ATC CCC GGC ACC TGG TTG TGC CGC AAC GGC ATG GAA GGC ACC CTG ATC 
asp asp ala glu ile pro gly thr trp leu cys arg asn gly met glu gly thr leu ile 
361/121 391/131 

GAG GGC GAC CTG CCC GAG CCG AAG AAG GTT AAG CCG CCC CGG ACG CAC TGG GAC ATG CTG 
glu gly asp leu pro glu pro lys lys val lys pro pro arg thr his trp asp met leu 
421/141 451/151 

CTG GAG CGC CGT TCC ATC GAA GAA CTC GAA GAG TTA CTT AAG GAG CGC CTC GAG CTC ATT 
leu glu arg arg ser ile glu glu leu glu glu leu leu lys glu arg leu glu leu ile , 
481/161 

CGG TCA CGT CGG CGC GGC TGA 
arg ser arg arg arg gly^OPA 

seq id fitc&:%6?-Tn 



FIGURE 4X' 



, . 13/185 





















31/11 


















GAT CGC 


GGT 


CAA 


CGA 


GGC 


CGA 


ATA 


CGG 


CGA 


GAT GTG 


GGC 


CCA 


AGA 


CGC 


CGC 


CGC 


GAT 


GTT 


asp arg 


gly 


gin 


arg 




arg 


ile 


arg 


arg 


asp val 


gly 


pro 


arg 


arg 


arg 


arg 


asp 


val 


61/21 












91/31 


















TGG CTA 


(CGC 


CGC 


GGC 


GAC 


GGC 


GAC 


GGC 


GAC 


GGC GAC 


GTT 


GCT 


GCC 


GTT 


CGA 


GGA 


GGC 


GCC 


trp leu 


arg 


arg 


gly 


asp 


gly 


asp 


gly 


asp 


gly asp 


val 


ala 


ala 


val 


arg 


gly 


gly 


ala 


121/41 


















151/51 


















GGA GAT 


GAC 


CAG 


CGC 


GGG 


TGG 


GCT 


CCT 


CGA 


GCA GGC 


CGC 


CGC 


GGT 


CGA 


GGA 


GGC 


CTC 


CGA 


gly asp 


asp 


gin 


arg 


gly 


trp 


ala 


pro 


arg 


ala gly 


arg 


arg 


gly 


arg 


gly 


gly 


leu 


arg 


181/61 


















211/71 


















CAC CGC 


CGC 


GGC 


GAA 


CCA 


GTT 


GAT 


GAA 


CAA 


TGT GCC 


CCA 


GGC 


GCT 


GCA 


ACA 


GCT 


GGC 


CCA 


his arg 


arg 


gly 


glu 


pro 


val 


asp 


glu 


gin 


cys ala 


pro 


gly 


ala 


ala 


thr 


ala 


gly 


pro 


241/81 


















271/91 


















GCC CAC 


GCA 


GGG 


CAC 


CAC 


GCC 


TTC 


TTC 


CAA 


GCT GGG 


TGG 


CCT 


GTG 


GAA 


GAC 


GGT 


CTC 


GCC 


ala his 


ala 


gly 


his 


his 


ala 


phe 


phe 


gin 


ala gly 


trp 


pro 


val 


glu 


asp 


gly 


leu 


ala 



301/101 

GCA TCG GTC GCC GAT C 
ala ser val ala asp^ 



SEQ ID NC&'.tt-V^ 
FIGURE 5A 



32/11 



(fATC GCG 


GTC 


AAC 


GAG 


GCC 


GAA 


TAC 


GGC 


GAG 


ATG TGG 


GCC 


CAA 


GAC 


GCC 


GCC 


GCG 


ATG 


TTT 


^VMile ala 

62/21 


val 


asn 


glu 


ala 


glu 


tyr 


gly 


glu 


met trp 


ala 


gin 


asp 


ala 


ala 


ala 


met 


phe 


















92/31 


















GGC TAC 


GCC 


GCG 


GCG 


ACG 


GCG 


ACG 


GCG 


ACG 


GCG ACG 


TTG 


CTG 


CCG 


TTC 


GAG 


GAG 


GCG 


CCG 


gly tyr 


ala 


ala 


ala 


thr 


ala 


thr 


ala 


thr 


ala thr 


leu 


leu 


pro 


phe 


glu 


glu 


ala 


pro 


122/41 


















152/51 


















GAG ATG 


ACC 


AGC 


GCG 


GGT 


GGG 


CTC 


CTC 


GAG 


CAG GCC 


GCC 


GCG 


GTC 


GAG 


GAG 


GCC 


TCC 


GAC 


glu met 


thr 


ser 


ala 


gly 


gly 


leu 


leu 


glu 


gin ala 


ala 


ala 


val 


glu 


glu 


ala 


ser 


asp 


182/61 


















212/71 


















ACC GCC 


GCG 


GCG 


AAC 


CAG 


TTG 


ATG 


AAC 


AAT 


GTG CCC 


CAG 


GCG 


CTG 


CAA 


CAG 


CTG 


GCC 


CAG 


thr ala 


ala 


ala 


asn 


gin 


leu 


met 


asn 


asn 


val pro 


gin 


ala 


leu 


gin 


gin 


leu 


ala 


gin 


242/81 


















272/91 


















CCC ACG 


CAG 


GGC 


ACC 


ACG 


CCT 


TCT 


TCC 


AAG 


CTG GGT 


GGC 


CTG 


TGG 


AAG 


ACG 


GTC 


TCG 


CCG 


pro thr 


gin 


gly 


thr 


thr 


pro 


ser 


ser 


lys 


leu gly 


gly 


leu 


trp 


lys 


thr 


val 


ser 


pro 



302/101 

CAT CGG TCG CCG ATC 
his arg ser pro ile 



SEQ ID fJD5'- ^O-^W 
FIGURE 5B 



14/185 

33/11 

(^.TCG CGG TCA ACG AGG CCG AAT ACG GCG AGA TGT GGG CCC AAG ACG CCG CCG CGA TGT TTG 
ser (arg ser thr arg pro asn thr ala arg cys gly pro lys thr pro pro arg cys leu 
63/21^^^>WO\ c \'B 93/31 

GCT ACG CCG CGG CGA CGG CGA CGG CGA CGG CGA CGT TGC TGC CGT TCG AGG AGG CGC CGG 
ala thr pro arg arg arg arg arg arg arg arg arg cys cys arg ser arg arg arg arg 
123/41 153/51 

AGA TGA CCA GCG CGG GTG GGC TCC TCG AGC AGG CCG CCG CGG TCG AGG AGG CCT CCG ACA 
arg^OPAmro ala arg val gly ser ser ser arg pro pro arg ser arg arg pro pro thr 
183/61 * ^6^^t>Ai0^4 213/71 

CCG CCG CGG CGA ACC AGT TGA TGA ACA ATG TGC CCC AGG CGC TGC AAC AGC TGG CCC AGC 
pro pro arg arg thr ser^OPA OP_A(thr met cys pro arg arg cys asn ser trp pro ser 
243/81 S^ifeflP'.^S*" 273/91 

CCA CGC AGG GCA CCA CGC CTT CTT CCA AGC TGG GTG GCC TGT GGA AGA CGG TCT CGC CGC 
pro arg arg ala pro arg leu leu pro ser trp val ala cys gly arg arg ser arg arg 
303/101 

ATC GGT CGC CGA TC 
ile gly arg arg^ 



SEQ ID PvlDS'.^a.-S'D 
FIGURE 5C 



part of the nucleotide sequence Seq SA 
^BCt Tt> NO". 9 (p 





















31/11 


















V)CGC CGC 


GGC 


GAC 


GGC 


GAC 


GGC 


GAC 


GGC 


GAC 


GTT GCT 


GCC 


GTT 


CGA 


GGA 


GGC 


GCC 


GGA 


GAT 


61/21 
GAC CAG 


gly 


asp 


gly 


asp 


gly 


asp 


gly 


asp 


val ala 
91/31 


ala 


val 


arg 


gly 


gly 


ala 


gly 


asp 


CGC 


GGG 


TGG 


GCT 


CCT 


CGA 


GCA 


GGC 


CGC CGC 


GGT 


CGA 


GGA 


GGC 


CTC 


CGA 


CAC 


CGC 


asp gin 


arg 


gly 


trp 


ala 


pro 


arg 


ala 


gly 


arg arg 


gly 


arg 


gly 


gly 


leu 


arg 


his 


arg 


121/41 


















151/51 


















CGC GGC 


GAA 


CCA 


GTT 


GAT 


GAA 


CAA 


TGT 


GCC 


CCA GGC 


GCT 


GCA 


ACA 


GCT 


GGC 


CCA 


GCC 


CAC 


arg gly 


glu 


pro 


val 


asp 


glu 


gin 


cys 


ala 


pro gly 


ala 


ala 


thr 


ala 


gly 


pro 


ala 


his 


181/61 


















271/71 


















GCA GGG 


CAC 


CAC 


GCC 


TTC 


TTC 


CAA 


GCT 


GGG 


TGG CCT 


GTG 


GAA 


GAC 


GGT 


CTC 


GCC 


GCA 


TCG 


ala gly 


his 


his 


ala 


phe 


phe 


gin 


ala 


gly 


trp pro 


val 


glu 


asp 


gly 


leu 


ala 


ala 


ser 


241/81 




































GTC GCC 


GAT 


C 
































val ala 


asp 



































SEQ ID : 



FIGURE 5A' 



15/185 

31/11 





f 1/1 








VTAC GCC 


GCG 


GCG 


ala 


ala 


ala 




61/21 








ATG ACC 


AGC 


GCG 




met thr 


ser 


ala 




121/41 








GCC GCG 


GCG 


AAC 




ala ala 


ala 


asn 




181/61 








ACG CAG 


GGC 


ACC 




thr gin 


gly 


thr 




241/81 








CGG TCG 


CCG 


ATC 




arg ser 


pro 


ile 



ACG 


GCG 


ACG 


GCG 


ACG 


GCG 


ACG TTG 


CTG 


CCG 


T.TC 


GAG 


GAG 


GCG 


CCG 


GAG 


thr 


ala 


thr 


ala 


thr 


ala 


thr leu 


leu 


pro 


phe 


glu 


glu 


ala 


pro 


glu 














91/31 


















GGT 


GGG 


CTC 


CTC 


GAG 


CAG 


GCC GCC 


GCG 


GTC 


GAG 


GAG 


GCC 


TCC 


GAC 


ACC 


gly 


gly 


leu 


leu 


glu 


gin 


ala ala 


ala 


val 


glu 


glu 


ala 


ser 


asp 


thr 














151/51 


















CAG 


TTG 


ATG 


AAC 


AAT 


GTG 


CCC CAG 


GCG 


CTG 


CAA 


CAG 


CTG 


GCC 


CAG 


CCC 


gin 


leu 


met 


asn 


asn 


val 


pro gin 


ala 


leu 


gin 


gin 


leu 


ala 


gin 


pro 














211/71 


















ACG 


CCT 


TCT 


TCC 


AAG 


CTG 


GGT GGC 


CTG 


TGG 


AAG 


ACG 


GTC 


TCG 


CCG 


CAT 


thr 


pro 


ser 


ser 


lys 


leu 


gly gly 


leu 


trp 


lys 


thr 


val 


ser 


pro 


his 



seq id mce:^-^ 

FIGURE 5B' 




1/1 31/11 
T ACG CCG CGG CGA CGG CGA CGG CGA CGG CGA CGT- TGC TGC CGT TCG AGG AGG CGC CGG AGA 
i^^/thr pro arg arg arg arg arg arg arg arg arg cys cys arg ser arg arg arg arg arg) 
^ 61/21 91/31 

TGA CCA GCG CGG GTG GGC TCC TCG AGC AGG CCG CCG CGG TCG AGG AGG CCT CCG ACA CCG 
OPA(pro ala arg val gly ser ser ser arg pro pro arg ser arg arg pro pro thr pro 
121/41 f ^> ffQ:tCi LWL 151/51 

CCG CGG CGA ACC AGT TGA TGA ACA ATG TGC CCC AGG CGC TGC AAC AGC TGG CCC AGC CCA 
pro arg arg thr ser) OPA OPA fthr met cys pro arg arg cys. asn ser trp pro ser pro 
181/61 5eG>X^^*.lDV>^ 211/71 

CGC AGG GCA CCA CGC CTT CTT CCA AGC TGG GTG GCC TGT GGA AGA CGG TCT CGC CGC ATC 
arg arg ala pro arg leu leu pro ser trp val ala cys gly arg arg ser arg arg ile 
241/81 

GGT CGC CGA TC 
gly arg arg^ 



seq id IDD-{D$ 

FIGURE 5C T 



16/185 



ORF predicted by Cole et al. (Nature 393:537-544) and containing seqSA* 
1/1 31/11 
<K*\D"-loK>»tga ac t gat gat tct gat age gac caa cct ctt ggg gca aaa cac ccc ggc gat cgc ggt 
OPA(thr asp asp ser asp ser asp gin pro leu gly ala lys his pro gly asp arg gly 
61/21 ^$eG>:Tt>NO:iOS 91/31 

caa cga ggc cga ata egg cga gat gtg ggc cca aga cgc cgc cgc gat gtt tgg eta cgc 
gin arg gly arg ile arg arg asp val gly pro arg arg arg arg asp val trp leu arg 
121/41 151/51 

cgc ggc gac ggc gac ggc gac ggc gac gtt get gcc gtt cga gga ggc gcc gga gat gac 
arg gly asp gly asp gly asp gly asp val ala ala val arg gly gly ala gly asp asp 
' 181/61 211/71 

cag cgc ggg tgg get cct cga gca ggc cgc cgc ggt cga gga ggc etc cga cac cgc cgc 
gin arg gly trp ala pro arg ala gly arg arg gly arg gly gly leu arg his arg arg 
241/81 271/91. 

ggc gaa cca gtt gat gaa caa tgt gcc cca ggc get gca aca get ggc cca gcc cac gca 
gly glu pro val asp glu gin cys ala pro gly ala ala thr ala gly pro ala his ala 
301/101 331/111 

ggg cac cac gcc ttc ttc caa get ggg tgg cct gtg gaa gac ggt etc gcc gca teg gtc 
gly his his ala phe phe gin ala gly trp pro val glu asp gly leu ala ala ser val 
361/121 391/131 

gcc gat cag caa cat ggt gtc gat ggc caa caa cca cat gtc gat gac caa etc ggg tgt 
ala asp gin gin his gly val asp gly gin glnpro his val asp asp gin leu gly cys 
421/141 451/151 

gtc gat gac caa cac ctt gag etc gat gtt gaa ggg ctt tgc tec ggc ggc ggc cgc cca 
val asp asp gin his leu glu leu asp val glu gly leu cys ser gly gly gly arg pro 

- 481/161- . 511/.171 

ggc cgt gca aac cgc ggc gca aaa egg ggt ccg ggc gat gag etc get ggg cag etc get 
gly arg ala asn arg gly ala lys arg gly pro gly asp glu leu ala gly gin leu ala 
541/181 571/191 

ggg ttc ttc ggg tct ggg egg tgg ggt ggc cgc caa ctt ggg teg ggc ggc etc ggt egg 
gly phe phe gly ser gly arg trp gly gly arg gin leu gly ser gly gly leu gly arg 
601/201 631/211 

ttc gtt gtc ggt gcc gca ggc ctg ggc cgc ggc caa cca ggc agt cac ccc ggc ggc gcg 
phe val val gly ala ala gly leu gly arg gly gin pro gly ser his pro gly gly ala 
661/221 691/231 

ggc get gcc get gac cag cct gac cag cgc cgc gga aag agg gcc egg gca gat get ggg 
gly ala ala ala asp gin pro asp gin arg arg gly lys arg ala arg ala asp ala gly 
721/241 751/251 

egg get gcc ggt ggg gca gat ggg cgc cag ggc egg tgg tgg get cag tgg tgt get gcg 
arg ala ala gly gly ala asp gly arg gin gly arg trp trp ala gin trp cys ala ala 
781/261 811/271 

tgt tec gcc gcg acc eta tgt gat gcc gca ttc tec ggc ggc egg eta gga gag ggg gcg 
cys ser ala ala thr leu cys asp ala ala phe ser gly gly arg leu gly glu gly ala 
841/281 

cag act gtc gtt att tga 
gin thr val val ileJOPA 

SEQ ID ^051 WHj 105 
FIGURE S£>' 



f 17/185 

sequence Rvll96 predicted by Cole et al. (Nature 393:537-544) and capable of encoding an 
4&> ORF fused with SeqSA* 

% 1/1 31/11 
K^atg gtg gat ttc ggg gcg tta cca ccg gag ate aac tec gcg agg atg tac gee ggc ccg 
^s^Met val asp phe gly ala leu pro pro glu ile asn ser ala arg met tyr ala gly pro 
^ 61/21 91/31 

ggt teg gec teg ctg gtg gec gcg get cag atg tgg gac age gtg gcg agt gac ctg ttt 
gly ser ala ser leu val ala ala ala gin met trp asp ser val ala ser asp leu phe 
*r 121/41 151/51 

teg gec gcg teg gcg ttt cag teg gtg gtc tgg ggt ctg acg gtg ggg teg tgg ata ggt 
ser ala ala ser ala phe gin ser val val trp gly leu thr val gly ser trp ile gly 
181/61 211/71 

teg teg gcg ggt ctg atg gtg gcg gcg gec teg ccg tat gtg gcg tgg atg age gtc acc 
ser ser ala gly leu met val ala 

ala ala ser pro tyr val ala trp met ser val thr 
241/81 271/91 

gcg ggg cag gee gag ctg acc gee gec cag gtc egg gtt get gcg gcg gee tac gag acg 
ala gly gin ala glu leu thr ala ala gin val arg val ala ala ala ala tyr glu thr 
301/101 331/111 

gcg tat ggg ctg acg gtg ccc ccg ccg gtg ate gee gag aac cgt get gaa ctg atg att 
ala tyr gly leu thr val pro pro pro val ile ala glu asn arg ala glu leu met ile 
361/121 . 391/131 

ctg ata gcg acc aac etc ttg ggg caa aac acc ccg gcg ate gcg gtc aac gag gee gaa 
leu ile ala thr asn leu leu gly gin asn thr pro ala ile ala val asn glu ala glu 
421/141 451/151 

tac ggc gag atg tgg gee caa gac gee gee gcg atg ttt ggc tac gec gcg gcg acg gcg 
tyr gly glu met trp ala gin asp ala ala ala met phe gly tyr ala ala ala thr ala 
481/161 511/171 

acg gcg acg gcg acg ttg ctg ccg ttc gag gag gcg ccg gag atg acc age gcg ggt ggg 

thr ala thr ala thr leu leu pro' phe 1 glu glu ala pro glu met thr ser ala gly gly 

541/181 ' 571/191 

etc etc gag cag gee gec gcg gtc gag gag gee tec gac acc gec gcg gcg aac cag ttg 
leu leu glu gin ala ala ala val glu glu ala ser asp thr ala ala ala asn gin leu 
601/201 631/211 

atg aac aat gtg ccc cag gcg ctg caa cag ctg gee cag ccc acg cag ggc acc acg cct 
met asn asn val pro gin ala leu gin gin leu ala gin pro thr gin gly thr thr pro 
661/221 691/231 

tct tec aag ctg ggt ggc ctg tgg aag acg gtc teg ccg cat egg teg ccg ate age aac 
ser ser lys leu gly gly leu trp lys thr val ser pro his arg ser pro ile ser asn 
721/241 751/251 

atg gtg teg atg gee aac aac cac atg teg atg acc aac teg ggt gtg teg atg acc aac 
met val ser met ala asn asn his met ser met thr asn ser gly val ser met thr asn 
781/261 811/271 

acc ttg age teg atg ttg aag ggc ttt get ccg gcg gcg gec gec cag gec gtg caa acc 
thr leu ser ser met leu lys gly phe ala pro ala ala ala ala gin ala val gin thr 
841/281 871/291 

gcg gcg caa aac ggg gtc egg gcg atg age teg ctg ggc age teg ctg ggt tct teg ggt 
ala ala gin asn gly val arg ala met ser ser leu gly ser ser leu gly ser ser gly 
901/301 931/311 

ctg ggc ggt ggg gtg gee gec aac ttg ggt egg gcg gee teg gtc ggt teg ttg teg gtg 
leu gly gly gly val ala ala asn leu gly arg ala ala ser val gly ser leu ser val 
961/321 991/331 

ccg cag gee tgg gee gcg gee aac cag gca gtc acc ccg gcg gcg egg gcg ctg ccg ctg 
pro gin ala trp ala ala ala asn gin ala val thr pro ala ala arg ala leu pro leu 
1021/341 1051/351 

acc age ctg acc age gec gcg gaa aga ggg ccc ggg cag atg ctg ggc ggg ctg ccg gtg 
thr ser leu thr ser ala ala glu arg gly pro gly gin met leu gly gly leu pro val 
1081/361 1111/371 

ggg cag atg ggc gee agg gee ggt ggt ggg etc agt ggt gtg ctg cgt gtt ccg ccg cga 
gly gin met gly ala arg ala gly gly gly leu ser gly val leu arg val pro pro arg 
1141/381 1171/391 
ccc tat gtg atg ccg cat tct ccg gcg gec ggc tag 
pro tyr val met pro his ser pro ala ala gly\AMB 

SEQ ID ^5'. (Dfe, IDH 

i 

FIGURE 5E' 
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^ Seq 5P: ORF according to Cole et al . (Nature 393:537-544) and containing the sequence 
> Rvll96 



B 1/1 



31/11 



^^tag gga cac gta atg gtg gat ttc ggg gcg tta cca ccg gag ate aac tec gcg agg atg 
AMB Gly his val met val asp phe gly ala leu pro pro glu ile asn ser ala arg met 
61/21^S^Q , Xt>ND: IP*J 91/31 

tac gec ggc ccg ggt teg gec teg ctg gtg gee gcg get cag atg tgg gac age gtg gcg 
tyr ala gly pro gly ser ala ser leu val ala ala ala gin met trp asp ser val ala 
121/41 151/51 

agt gac ctg ttt teg gec gcg teg gcg ttt cag teg gtg gtc tgg ggt ctg acg gtg ggg 
ser asp leu phe ser ala ala ser ala phe gin ser val val trp gly leu thr val gly 
' 181/61 211/71 

teg tgg ata ggt teg teg gcg ggt ctg atg gtg gcg gcg gec teg ccg tat gtg gcg tgg 
ser trp ile gly ser ser ala gly leu met val ala ala ala ser pro tyr val ala trp 
241/81 271/91 

atg age gtc ace gcg ggg cag gee gag ctg ace gec gee cag gtc egg gtt get gcg gcg . 
met ser val thr ala gly gin ala glu leu thr ala ala gin val arg val ala ala ala 
301/101 331/111 

gec tac gag acg gcg tat ggg ctg acg gtg ccc ccg ccg gtg ate gec gag aac cgt get 
ala tyr glu thr ala tyr gly leu thr val pro pro pro val ile ala glu asn arg ala 
361/121 391/131 

gaa ctg atg att ctg ata gcg acc aac etc ttg ggg caa aac ace ccg gcg ate gcg gtc 
glu leu met ile leu ile ala thr asn leu leu gly gin asn thr pro ala ile ala val 
421/141 451/151 t 

aac gag gee gaa tac ggc gag atg tgg gee caa gac gec gee gcg atg ttt ggc tac gee 
asn glu ala glu tyr gly glu met trp ala gin asp ala ala ala met phe gly tyr ala 
48.1/161 511/171 

gcg gcg acg gcg acg gcg acg gcg acg ttg ctg ccg ttc gag gag gcg ccg gag atg acc 
' " ala ala thr ala thr ala thr ala thr leu leu pro phe glu glu ala pro glu met thr 
541/181 571/191 

age gcg ggt ggg etc etc gag cag gee gee gcg gtc gag gag gee tec gac acc gee gcg 
ser ala gly gly leu leu glu gin ala ala ala val glu glu ala ser asp thr ala ala 
601/201 631/211 

gcg aac cag ttg atg aac aat gtg ccc cag gcg ctg caa cag ctg gee cag ccc acg cag. 
ala asn gin leu met asn asn val pro gin ala leu gin gin leu ala gin pro thr gin 
661/221 691/231 

ggc acc acg cct tct tec aag ctg ggt ggc ctg tgg aag acg gtc teg ccg cat egg teg 
gly thr thr pro ser ser lys leu gly gly leu trp lys thr val ser pro his arg ser 
721/241 751/251 

ccg ate age aac atg gtg teg atg gec aac aac cac atg teg atg acc aac teg ggt gtg 
pro ile ser asn met val ser met ala asn asn his met ser met thr asn ser gly val 
781/261 811/271 

teg atg acc aac acc ttg age teg atg ttg aag ggc ttt get ccg gcg gcg gec gec cag 
ser met thr asn thr leu ser ser met leu lys gly phe ala pro ala ala ala ala gin 
841/281 871/291 

gee gtg caa acc gcg gcg caa aac ggg gtc egg gcg atg age teg ctg ggc age teg ctg 
ala val gin thr ala ala gin asn gly val arg ala met ser ser leu gly ser ser leu 
901/301 931/311 

ggt tct teg ggt ctg ggc ggt ggg gtg gee gee aac ttg ggt egg gcg gee teg gtc ggt 
gly ser ser gly leu gly gly gly val ala ala asn leu gly arg ala ala ser val gly 
961/321 991/331 

teg ttg teg gtg ccg cag gee tgg gee gcg gee aac cag gca gtc acc ccg gcg gcg egg 
ser leu ser val pro gin ala trp ala ala ala asn gin ala val thr pro ala ala arg 
1021/341 1051/351 

gcg ctg ccg ctg acc age ctg acc age gee gcg gaa aga ggg ccc ggg cag atg ctg ggc 
ala leu pro leu thr ser leu thr ser ala ala glu arg gly pro gly gin met leu gly 
1081/361 1111/371 

ggg ctg ccg gtg ggg cag atg ggc gee agg gee ggt ggt ggg etc agt ggt gtg ctg cgt 
gly leu pro val gly gin met gly ala arg ala gly gly gly leu ser gly val leu arg 
1141/381 1171/391 

gtt ccg ccg cga ccc tat gtg atg ccg cat tct ccg gcg gee ggc tag 



val pro pro arg pro tyr val met pro. his ser pro ala ala glyj 



AMB 



seq id mds: fDS, 



FIGURE 5F' 
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V s 31/11 

GGA TCC TGA TGC AAG TGG TCC GGG ATT TGT CGG CAG CCA CCG CGG TCC CGT CGA CCA ACG 
gly ser OPA (cys lys trp ser gly ile cys arg gin pro arg arg ser arg arg pro thr 
61/21 SPftXDWCWU 91/31 

TTG GTG CAT CCG GGC TGC GAG CAT GCA CGC ACC GAC CAG CGC GGC GAG CGC GGC TAG CTG 
leu val his pro gly cys glu his ala arg thr asp gin arg gly glu arg gly^AMB^Leu 
121/41 151/51 IU— * 

CTT GCC CAC TGT TCC TCC CTG CCG GCA CCA TGT GCG ACA AGC TTA AGC GCA GCA GTA CCG 
leu ala his cys ser ser leu pro ala pro cys ala thr ser leu ser ala ala val pro 
181/61 211/71 

GCG GTG CCT GGG CAT CCA GCA AAA CGG GGA GCT CAA GAA CGA TTC ATG AAC GAG GGG TCG 
ala val pro gly his pro ala lys arg gly ala gin glu arg phe met asn glu gly sec 
241/81 271/91 

TCA CCA ACG TCG AAA CCG ACG GTT GCC AGC CGG CCC ACG ATA TTG CGT GCT CGA GGG TCC 
ser pro thr ser lys pro thr val ala ser arg pro thr ile leu arg ala arg gly ser 
301/101 331/111 

GCT GTA CCC TCA CCG AAC GTG AGT CCC ACA CCG CGG AGG CGG GCG ACT CTG GCG TCG TTA 
ala val pro ser pro asn val ser pro thr pro arg arg arg ala thr leu ala ser leu 
361/121 391/131 

GCA GCC GAG CTC AAG GTG TCC CGC ACC ACT GTC TCG AAT GCT TTT AAC CGA CCG GAT CCA 
ala ala glu leu lys val ser arg thr thr val ser asn ala phe asn arg pro asp pro 
421/141 

GAA GGA GAA GAIL C 
glu gly glu aspj 

SEQ ID flD&MID- 1 1 0. 
FIGURE 6A 

^e® 32/n 

VGAT CCT GAT GCA AGT GGT CCG GGA TTT GTC GGC AGC CAC GGC GGT CCC GTC GAC CAA CGT 
vi4 V a sp pro asp ala ser gly pro gly phe val gly ser his gly gly pro val asp gin arg. 
rlrf^ 0 62/21 92/31 

TGG TGC ATC CGG GCT GCG AGC ATG CAC GCA CCG ACC AGC GCG GCG AGC GCG GCT AGC TGC 
trp cys ile arg ala ala ser met his ala pro thr ser ala ala ser ala ala ser cys 
122/41 152/51 

TTG CCC ACT GTT CCT CCC TGC CGG CAC CAT GTG CGA CAA GCT TM GCG CAG CAG TAC CGG 
leu pro thr val pro pro cys arg his his val arg gin alayOCH (ala gin gin tyr arg 

182/61 212/71 ' v ^se«»fc/r>:ji5 

CGG TGC CTG GGC ATC CAG CAA AAC GGG GAG CTC AAG AAC GAT TCA TGA ACG AGG GGT CGT 
arg cys leu gly ile gin gin asn gly glu leu lys asn asp ser)oPA(thr arg gly arg 
242/81 272/91 ^seQXbHO' U<> 

CAC CAA CGT CGA AAC CGA CGG TTG CCA GCC GGC CCA CGA TAT TGC GTG CTC GAG GGT CCG 
his gin arg arg asn arg arg leu pro ala gly pro arg tyr cys val leu glu gly pro 
302/101 332/111 

CTG TAC CCT CAC CGA ACG TGA GTC CCA CAC CGC GGA GGC GGG CGA CTC TGG CGT CGT TAG 
leu tyr pro his arg thr) OPA (val pro his arg gly gly gly arg leu trp arg argjAMB 
362/121 seQXbUDWW 3 * 392/131 

CAG CCG AGC TCA AGG TGT CCC GCA CCA CTG TCT CGA ATG CTT TTA ACC GAC CGG ATC CAG 
*v(^gln pro ser ser arg cys pro ala pro leu ser arg met leu leu thr asp arg ile gin 
, N sfc 422/141 

AAG GAG AAG ATC 
lys glu lys ile") 

seq id pV£>5; U3- 



FIGURE 6B 



20/185 



/DttA./*?*ATC CTG 


ATG 


CAA 


GTG 


GTC 


CGG 


GAT 


TTG 


(lie leu 


met 


gin 


val 


val 


arg 


asp 


leu 


















GGT GCA 


TCC 


GGG 


CTG 


CGA 


GCA 


TGC 


ACG 


gly ala 


ser 


gly 


leu 


arg 


ala 


cys 


thr 


123/41 
















TGC CCA 


CTG 


TTC 


CTC 


CCT 


GCC 


GGC 


ACC 


cys pro 


leu 


phe 


leu 


pro 


ala 


gly 


thr 


183/61 
















GGT GCC 


TGG 


GCA 


TCC 


AGC 


AAA 


ACG 


GGG 


gly ala 


trp 


ala 


ser 


ser 


lys 


thr 


gly 


243/81 
















ACC AAC 


GTC 


GAA 


ACC 


GAC 


GGT 


TGC 


CAG 


thr asn 


val 


glu 


thr 


asp 


gly 


cys 


gin 


303/101 
















TGT ACC 


CTC 


ACC 


GAA 


CGT 


GAG 


TCC 


CAC 


cys thr 


leu 


thr 


glu 


arg 


glu 


ser 


his 


363/121 
















AGC CGA 


GCT 


CAA 


GGT 


GTC 


CCG 


CAC 


CAC 


ser arg 


ala 


gin 


gly 


val 


pro 


his 


his 


423/141 
















AGG AGA 


AGA 


TC 













arg arg argj 



33/11 



TCG 


GCA GCC 


ACG 


GCG 


GTC 


CCG 


TCG 


ACC 


AAC 


GTT 


ser 


ala ala 


thr 


ala 


val 


pro 


ser 


thr 


asn 


val 




ft *3 / O 1 


















r* T\ 

CAC 


CGA CCA 


GCG 


CGG 


CGA 


GCG 


CGG 


CTA 


GCT 


GCT 


his 


arg pro 


ala 


arg 


arg 


ala 


arg 


leu 


ala 


ala 






















ATG 


TGC GAC 


AAG 


CTT 


AAG 


CGC 


AGC 


AGT 


ACC 


GGC 


met 


cys asp 


lys 


leu 


lys 


arg 


scr 


ser 


thr 


gly 




213/71 


















AGC 


TCA AGA 


ACG 


ATT 


CAT 


GAA 


CGA 


GGG 


GTC 


GTC 


ser 


ser arg 


thr 


ile 


his 


glu 


arg 


gly 


val 


val 




273/91 


















CCG 


GCC CAC 


GAT 


ATT 


GCG 


TGC 


TCG 


AGG 


GTC 


CGC 


pro 


ala his 


asp ile 


i ala cys ser arg val ar« 




333/111 


















ACC 


GCG GAG 


GCG 


GGC 


GAC 


TCT 


GGC 


GTC 


GTT 


AGC 


thr 


ala glu 


ala 


gly 


asp 


ser 


gly 


val 


val 


ser 




393/131 


















TGT 


CTC GAA 


TGC 


TTT 


TAA 


CCG 


ACC 


GGA 


TCC 


AGA 


cys 


leu 1 glu 


cys 


phe) 


OCH 


fpro_ 


thr 


gly 


•ser 


arg 








2£&> jD Ho 


- /2i' 









SEQ ID /IPS. / / 
FIGURE 6C 



5C(S) iD/la 122 31/n 

CCG TCG GCA ACT TGG CCG CTG AGG TCG GCT TGA TCC CTG GGC CGA GGC GGG TCA GCC AAT 
^(pro ser ala thr trp pro leu arg ser ala) CPA (ser leu gly arg gly gly ser ala asn 

lDrt/*3 61/21 91/31*S£G> /2> **./2</ 

Ul AGC GGC TCC ATC GGC TTT GCT GGT AGC GGT TCG GCG GGA AGC TAG CGG CGA CGT TGT CGG 

ser gly ser ile gly phe ala gly ser gly ser ala gly ser) AMB (arg arg arg cys arg 
121/41 151/51 S£<£> '"£5 

TGG CCG GTG ATA TAT TCG GTC AGA CGG GTA TGG CGG CGG CTG AGG TGA TCT GCG ACA CGC 

trp pro val ile tyr trp val arg arg val trp arg arg leu ar^ OPA (ser ala thr arg 
181/61 211/71 i& Ho. <2<"" 

CGC CGC GGT GCT CGA GCC AGG CTT ACG ACC AGG GAA TTT CGA AAA TGT TAT TCA GAA CAT 

arg arg gly ala arg ala arg leu thr thr arg glu phe arg lys cys tyr ser glu his 
241/81 271/91 

CTT GTA TCT CTT CTC CGT GCC ACC CCC TAG GTG TAG TGT TTT CGA GTA CCG GCA GAT CCC 

leu val ser leu leu arg ala thr pro) AMB val AM^(cys phe arg val pro ala asp pro 

301/101 SC& ID. fa. 

AGG TTC ACC AGG TCT CAC CAG ATC 

arg phe thr arg ser his gin ile) 



SEQ ID . W0&./22- '2-7 



FIGURE 7 A 



21/185 



*5&G> '^f^CGT CGG CAA CTT GGC CGC TGA GGT CGG CTT GAT CCC TGG GCC GAG GCG GGT CAG CCA ATA 
^(arg arg gin leu gly arg) OPA^ly arg leu asp pro trp ala glu ala gly gin pro ile 
5c® <D Ma/29 62/21 C3£& ft> Mo- igD 92/31 



GCG GCT 


CCA TCG 


GCT 


TTG 


CTG 


GTA 


GCG 


GTT 


CGG CGG 


GAA 


GCT AGC 


GGC 


GAC 


GTT 


GTC 


GGT 


ala ala 


pro ser 


ala 


leu 


leu 


val 


ala 


val 


arg arg 


glu 


ala ser 


gly 


asp 


val 


val 


gly 


122/41 
















152/51 
















GGC CGG 


TGA TAT 


ATT 


GGG 


TCA 


GAC 


GGG 


TAT 


GGC GGC 


GGC 


TGA GGT 


GAT 


CTG 


CGA 


CAC 


GCC 


gly arg) OPAjftyr 


ile 


gly 


ser 


asp 


gly 


tyr 


gly. gly 


gly) OPA/gly 


asp 


leu 


arg 


his 


ala 


182/61 












212/71 














GCC GCG 


GTG CTC 


GAG 


CCA 


GGC 


TTA 


CGA 


CCA 


GGG AAT 


TTC 


GAA AAT 


GTT 


ATT 


CAG 


AAC 


ATC 


ala ala 


val leu 


glu 


pro 


gly 


leu 


arg 


pro 


gly asn 


phe 


glu asn 


val 


ile 


gin 


asn 


ile 


242/81 
















272/91 
















TTG TAT 


CTC TTC 


TCC 


GTG 


CCA 


CCC 


CCT 


AGG 


TGT AGT 


GTT 


TTC GAG 


TAC 


CGG 


CAG 


ATC 


CCA 


leu tyr 


leu phe 


ser 


val 


pro 


pro 


pro 


arg 


cys ser 


val 


phe glu 


tyr 


arg 


gin 


ile 


pro 



302/101 

GGT TCA CCA GGT CTC ACC AGA TC 
gly ser pro gly leu thr arg) 



SEQ ID Res- \2%-lS2. 
FIGURE 7B 



^-^GTC GGC AAC TTG 

icCSl iDrto.iSM-^^ 31 9 1 y asn leu 

63/21 

CGG CTC CAT CGG 
ylQ iDto '3|( ar g leu his arg 
123/41 

GCC GGT GAT ATA 
ala gly asp ile 
183/61 

CCG CGG TGC TCG 
pro arg cys ser 
243/81 

TGT ATC TCT TCT 
cys ile ser ser 
303/101 

GTT CAC CAG GTC 
val his gin val 



33/11 

GCC GCT GAG GTC GGC TTG ATC CCT 
ala ala glu val gly leu ile pro 

93/31 

CTT TGC TGG TAG CGG TTC GGC GGG 
leu cys trp) AMEL(arg phe gly gly 

S^tiDito. \3(x> 153/51 
TTG GGT CAG ACG GGT ATG GCG GCG 
leu gly gin thr gly met ala ala 

213/71 

AGC CAG GCT TAC GAC CAG GGA ATT 
ser gin ala tyr asp gin gly ile 

273/91 

CCG TGC CAC CCC CTA GGT GTA GTG 
pro cys his pro leu gly val val 

TCA CCA GAT C 
ser pro asp) 



GGG CCG AGG CGG GTC AGC CAA TAG 
gly pro arq arg val ser gin} AMB 

AAG CTA GCG GCG ACG TTG TCG GTG 
lys leu ala ala thr leu ser val 

GCT GAG GTG ATC TGC GAC ACG CCG 
ala glu val ile cys asp thr pro 

TCG AAA ATG TTA TTC AGA ACA TCT 
ser lys met leu phe arg thr ser 

TTT TCG AGT ACC GGC AGA TCC CAG 
phe ser ser thr gly arg ser gin 



SEQ ID Kc©. t33-/3Co 



FIGURE 7C 



22/185 



3Cl> JUno.^ CTT TGC 


GTG 


ATG 


TCC 


AAT 


GGC 


GAA 


AAC 


GAC 


31/11 
GCC TTG 


TCA 


TCG 


CAA 


TCG 


TCA 


GCA 


CCG 


GCC 


^'D^ia«->( leu c ^ s 


val 


met 


ser 


asn 


gly 


glu 


asn 


asp 


ala leu 


ser 


ser 


gin 


ser 


ser 


ala 


pro 


ala^ 


61/21 


















91/31 


















TAG TTT 


TCG 


CGA 


TGA 


CGC 


TCG 


TTC 


TGA 


CCG 


GAC TTG 


TGA 


ACG 


GGT 


TTC 


GGG 


TCG 


AGG 


CCG 


AMB phe 


ser 


arg 


OPA 


arg 


ser 


phe 


OPA 


pro 


asp leu 


OPAjfchr 


gly 


phe 




ser 


arg 


pro 


121/41 


















151/51 




Z 1Z 








AGC GAA 


CCG 


TCG 


ATT 


CCA 


TGG 


GTG 


TCG 


ACG 


CAT TCG 


TGG 


TCA 


AGG 


CCG 


GCG 


CGG 


CAG 


GAC 


ser glu 


pro 


ser 


ile 


pro 


trp 


val 


ser 


thr 


his ser 


trp 


ser 


arg 


pro 


ala 


arg 


gin 


asp 


181/61 


















211/71 


















CGT TCC 


TGG 


GTT 


CGA 


CAC 


CAT 


TCG 


CCC 


AAA 


TCG ACC 


TGC 


CCC 


AGG 


TTG 


CTC 


GTG 


CGC 


CTG 


arg ser 


trp 


val 


arg 


his 


his 


ser 


pro 


lys 


ser thr 


cys 


pro 


arg 


leu 


leu 


val 


arg 


leu 


241/81 


















271/91 


















GCG TCT 


TGG 


CTG 


CCG 


CCC 


CAC 


TAG 


CGA 


CTG 


CGC CGT 


CGA 


CGA 


TCC 


GGC 


AGG 


GCA 


CGT 


CAG 


ala ser 


trp 


leu 


pro 


pro 


his) 


AMBfarg 


leu 


arg arg 


arg 


arg 


ser 


gly 


arg 


ala 


arg 


gin 


301/101 










sea 


ZTS 


331/111 


















CGC GAA 


ACG 


TCA 


CCG 


CGT 


TCG 


GGG 


CAC 


CAG 


AGC ACG 


GAC 


CCG 


GCA 


TGC 


CGC 


GGG 


TCT 


CGG 


arg glu 


thr 


ser 


pro 


arg 


ser 


gly 


his 


gin 


ser thr 


asp 


pro 


ala 


cys 


arg 


gly 


ser 


arg 


361/121 


















391/131 


















ACG GTC 


GGG 


CGC 


CAT 


CGA 


CGC 


CGG 


ACG 


AGG 


TCG CGG 


TGT 


CGA 


GCA 


CGC 


TGG 


GCC 


GAA 


ACC 


thr val 


gly 


arg 


his 


arg 


arg 


arg 


thr 


arg 


ser arg 


cys 


arg 


ala 


arg 


trp 


ala 


glu 


thr 



421/141 

TCG GCG ACG ATC 
ser ala thr ile^ 

1 SEQ ID ja^. 131-^,272-^73 
FIGURE 8A 





















32/11 


















V "- J >TTT GCG 


TGA 


TGT 


CCA 


ATG 


GCG 


AAA 


ACG 


ACG 


CCT TGT 


CAT 


CGC 


AAT 


CGT 


CAG 


CAC 


CGG 


CCT 


phe ala 


OPA 


(cys 


pro 


met 


ala 


lys 


thr 


thr 


pro cys 


his 


arg 


asn 


arg 


gin 


his 


arg 


pro 


62/21 


S££> Ho< /MO 








92/31 


















AGT TTT 


CGC 


GAT 


GAC 


GCT 


CGT 


TCT 


GAC 


CGG 


ACT TGT 


GAA 


CGG 


GTT 


TCG 


GGT 


CGA 


GGC 


CGA 


ser phe 


arg 


asp 


asp 


ala 


arg 


ser 


asp 


arg 


thr cys 


glu 


arg 


val 


ser 


gly 


arg 


gly 


arg 


122/41 


















152/51 


















GCG AAC 


CGT 


CGA 


TTC 


CAT 


GGG 


TGT 


CGA 


CGC 


ATT CGT 


GGT 


CAA 


GGC 


CGG 


CGC 


GGC 


AGG 


ACC 


ala asn 


arg 


arg 


phe 


his 


gly 


cys 


arg 


arg 


ile arg 


gly 


gin 


gly 


arg 


arg 


gly 


arg 


thr 


182/61 


















212/71 


















GTT CCT 


GGG 


TTC 


GAC 


ACC 


ATT 


CGC 


CCA 


AAT 


CGA CCT 


GCC 


CCA 


GGT 


TGC 


TCG 


TGC 


GCC 


TGG 


val pro 


gly 


phe 


asp 


thr 


ile 


arg 


pro 


asn 


arg pro 


ala 


pro 


gly 


cys 


ser 


cys 


ala 


trp 


242/81 


















272/91 


















CGT CTT 


GGC 


TGC 


CGC 


CCC 


ACT 


AGC 


GAC 


TGC 


GCC GTC 


GAC 


GAT 


CCG 


GCA 


GGG 


CAC 


GTC 


AGC 


arg leu 


gly 


cys 


arg 


pro 


thr 


ser 


asp 


cys 


ala val 


asp 


asp 


pro 


ala 


gly 


his 


val 


ser 


302/101 


















332/111 


















GCG AAA 


CGT 


CAC 


CGC 


GTT 


CGG 


GGC 


ACC 


AGA 


GCA CGG 


ACC 


CGG 


CAT 


GCC 


GCG 


GGT 


CTC 


GGA 


ala lys 


arg 


his 


arg 


val 


arg 


gly 


thr 


arg 


ala arg 


thr 


arg 


his 


ala 


ala 


gly 


leu 


gly 


362/121 


















392/131 


















CGG TCG 


GGC 


GCC 


ATC 


GAC 


GCC 


GGA 


CGA 


GGT 


CGC GGT 


GTC 


GAG 


CAC 


GCT 


GGG 


CCG 


AAA 


CCT 


arg ser 


gly 


ala 


ile 


asp 


ala 


gly 


arg 


gly 


arg gly 


val 


glu 


his 


ala 


gly 


pro 


lys 


pro 



422/141 

CGG CGA CGA TC 
arg arg arg) 



FIGURE 8B 



23/185 





















33/11 


















^^TTG CGT 


GAT 


GTC 


CAA 


TGG 


CGA 


AAA 


CGA 


CGC 


CTT GTC 


ATC 


GCA 


ATC 


GTC 


AGC 


ACC 


GGC 


CTA 


Iftrt>*a ; Jleu arg 
'^63/21 


asp 


val 


gin 


trp 


arg 


lys 


arg 


arg 


leu val 


ile 


ala 


ile 


val 


ser 


thr 


gly 


leu 


















93/31 


















GTT TTC 


GCG 


ATG 


ACG 


CTC 


GTT 


CTG 


ACC 


GGA 


CTT GTG 


AAC 


GGG 


TTT 


CGG 


GTC 


GAG 


GCC 


GAG 


val phe 


ala 


met 


thr 


leu 


val 


leu 


thr 


gly 


leu val 


asn 


gly 


phe 


arg 


val 


glu 


ala 


glu 


123/41 


















153/51 


















CGA ACC 


GTC 


GAT 


TCC 


ATG 


GGT 


GTC 


GAC 


GCA 


TTC GTG 


GTC 


AAG 


GCC 


GGC 


GCG 


GCA 


GGA 


CCG 


arg thr 


val 


asp 


ser 


met 


gly 


val 


asp 


ala 


phe val 


val 


lys 


ala 


gly 


ala 


ala 


gly 


pro 


183/61 


















213/71 


















TTC CTG 


GGT 


TCG 


ACA 


CCA 


TTC 


GCC 


CAA 


ATC 


GAC CTG 


CCC 


CAG 


GTT 


GCT 


CGT 


GCG 


CCT 


GGC 


phe leu 


gly 


ser 


thr 


pro 


phe 


ala 


gin 


ile 


asp leu 


pro 


gin 


val 


ala 


arg 


ala 


pro 


gly 


243/81 
















273/91 


















GTC TTG 


GCT 


GCC 


GCC 


CCA 


CTA 


GCG 


ACT 


GCG 


CCG TCG 


ACG 


ATC 


CGG 


CAG 


GGC 


ACG 


TCA 


GCG 


val leu 


ala 


ala 


ala 


pro 


leu 


ala 


thr 


ala 


pro ser 


thr 


ile 


arg 


gin 


gly 


thr 


ser 


ala 


303/101 


















333/111 


















CGA AAC 


GTC 


ACC 


GCG 


TTC 


GGG 


GCA 


CCA 


GAG 


CAC GGA 


CCC 


GGC 


ATG 


CCG 


CGG 


GTC 


TCG 


GAC 


arg asn 


val 


thr 


ala 


phe 


gly 


ala 


pro 


glu 


his gly 


pro 


gly 


met 


pro 


arg 


val 


ser 


asp 


363/121 


















393/131 


















GGT CGG 


GCG 


CCA 


TCG 


ACG 


CCG 


GAC 


GAG 


GTC 


GCG GTG 


TCG 


AGC 


ACG 


CTG 


GGC 


CGA 


AAC 


CTC 


gly arg 


ala 


pro 


ser 


thr 


pro 


asp 


glu 


val 


ala val 


ser 


ser 


thr 


leu 


gly 


arg 


asn 


leu 



423/141 

GGC GAC GAT C 

gly asp asp) 

SEQ ID : Ax>5. /V/-/V£ 
FIGURE 8C 



part of the nucleotide sequence of seq8A 



1/1 


















31/11 


















£<£EDrtO.W3*; AG GTT 


GCT 


CGT 


GCG 


CCT 


GGC 


GTC 


TTG 


GCT 


GCC GCC 


CCA 


CTA 


GCG 


ACT 


GCG 


CCG 


TCG 


ACG 


^g/gln val 
i& ID Ho./v</ bl/21 


ala 


arg 


ala 


pro 


gly 


val 


leu 


ala 


ala ala 


pro 


leu 


ala 


thr 


ala 


pro 


ser 


thr 


















91/31 


















ATC CGG 


CAG 


GGC 


ACG 


TCA 


GCG 


CGA 


AAC 


GTC 


ACC GCG 


TTC 


GGG 


GCA 


CCA 


GAG 


CAC 


GGA 


CCC 


ile arg 


gin 


gly 


thr 


ser 


ala 


arg 


asn 


val 


thr ala 


phe 


gly 


ala 


pro 


glu 


his 


gly 


pro 


121/41 
















151/51 


















GGC ATG 


CCG 


CGG 


GTC 


TCG 


GAC 


GGT 


CGG 


GCG 


CCA TCG 


ACG 


CCG 


GAC 


GAG 


GTC 


GCG 


GTG 


TCG 


gly met 


pro 


arg 


val 


ser 


asp 


gly 


arg 


ala 


pro ser 


thr 


pro 


asp 


glu 


val 


ala 


val 


ser 


181/61 




































AGC ACG 


CTG 


GGC 


CGA 


AAC 


CTC 


GGC 


GAC 


GAT 


C 


















ser thr 


leu 


gly 


arg 


asn 


leu 


gly 


asp 


asp) 





















SEQ ID: 

FIGURE 8A' 



24/185 



era jl> ko W6 ^ i GG Gt CGC ctg gcg tct tgg ctg 3i/ii cc ^ ^ 

■« ibfc. ia 6 { q /2 \ &U l6U Val arg leU 913 Ser tr ? leu Pjo pro his) AMB^arg leu arg arg arg 

ler olv ^ T ^ TCA TCGS ^ ^C^g'^C ACG GAC CCG 

121/ A 9 9 ^ U ^ ill/H 9 gly hiS gln ser thr as P 

GCA TGC CGC GGG TCT CGG ACG GTC GGG CGC CAT CGA CGC CGG ACG AGG TCG CGG TGT CGA 
alleys arg gly se r arg thr val gly arg his arg arg . arg thr arg ser ^ ^ JGA 

GCA CGC TGG GCC GAA ACC TCG GCG ACG ATC 
ala arg trp ala glu thr ser ala thr ile) 

SEQ ID ^„ IQ^WJ 
FIGURE 8B ? 



Seq8C 

1/1 31/11 

to ^O./^-^ CCA GGT TGC TCG TGC GCC TGG CGT CTT GGC TGC CGC CCC ACT AGC GAC TGC GCC ,GTC GAC 

/'<N ^ ii ( G <£f° gly CyS Ser cys ala tr ? ar 9 leu ^ly cys arg pro thr ser asp cys ala val asp 

61/21 91/31 

GAT CCG GCA GGG CAC GTC AGC GCG AAA CGT CAC CGC GTT CGG GGC ACC AGA GCA CGG ACC 

asp pro ala gly his val ser ala lys arg his arg val arg gly thr arg ala arg thr 

121/41 151/51 

CGG CAT GCC GCG GGT CTC GGA CGG TCG GGC GCC ATC GAC GCC GGA CGA GGT CGC GGT GTC 

arg his ala ala gly leu gly arg ser gly ala ile asp ala gly arg gly arg gly val 
181/61 

GAG CAC GCT GGG CCG AAA CCT CGG CGA CGA TC 
glu his ala gly pro lys pro arg arg arg 1 ) 

SEQ ID K^q. m~l<4Q 



FIGURE 8C f 
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sequence Rv2563 predicted by Cole 
seq8A' 

atg 
■ met 
121/4T 

it> Ko (60 ctt ttt g C g g C t ttg cgt gat gtc caa 

r &> li> Uo./6>/ Tl eu P he ala ala leu ar 9 as P val 9 ln 
181/61 

age acc ggc eta gtt ttc gcg atg acg 
ser thr gly leu val phe ala met thr 
. 241/81 

gtc gag gec gag cga acc gtc gat tec 
val glu ala glu arg thr val asp ser 
301/101 

gcg gca gga ccg ttc ctg ggt teg aca 
ala ala gly pro phe leu gly ser thr 
361/121 

cgt gcg cct ggc gtc ttg get gee gec 
arg ala pro gly val leu ala ala ala 
421/141 

ggc acg tea gcg cga aac gtc acc gcg 
gly thr ser ala arg asn val thr ala 
. 481/161 

egg gtc teg gac ggt egg gcg cca teg 
arg val ser asp gly arg ala pro ser 
541/181 

' ' ggc cga aac etc ggc gac gat ctg caa 
gly arg asn leu gly asp asp leu gin 
601/201 

gtg ccc gag tea acc gcg ctg gca aag 
val pro glu ser thr ala leu ala lys 
661/221 

cag cag ttg gca tac aac gga cag ccg 
gin gin leu ala tyr asn gly gin pro 
721/241 

cga cag etc ccg gac ggc tat cag acc 
arg gin leu pro asp gly tyr gin thr 
781/261 

cgc ccg ttg aag gtc gcg gtg gat gcg 
arg pro leu lys val ala val asp ala 
841/281 

gcg gcg ttg ate gtc ggc teg gtg gtc 
ala ala leu ile val gly ser val val 
901/301 

gcg gtg ttc aag gcg ate ggc gtg ccg 
ala val phe lys ala ile gly val pro 
961/321 

gcg gtc gtc gtc gcg ctg etc gcg gcg 
ala val val val ala leu leu ala ala 
1021/341 

ccg ttg ttc ccg atg act gtc gtg gta 
pro leu phe pro met thr val val val 
1081/361 

gcg act gtg ate ggt ctg ctg gee age 
ala thr val ile gly leu leu ala ser 
1141/381 

ccg gca eta gcg ttc gga ggt ccc tag 
pro ala leu ala phe gly gly pro AM^ 



et al. (Nature 393:537-544) and containing 



151/51 

tgg cga aaa cga cgc ctt gtc ate gca ate gtc 
trp arg lys arg arg leu val ile ala ile val 
211/71 

etc gtt ctg acc gga ctt gtg aac ggg ttt egg 
leu val leu thr gly leu val asn gly phe arg 
271/91 

atg ggt gtc gac gca ttc gtg gtc aag gee ggc 
met gly val asp ala phe val val lys ala gly 
331/111 

cca ttc gec caa ate gac ctg ccc cag gtt get 
pro phe ala gin ile asp leu pro gin val ala 
391/131 

cca eta gcg act gcg ccg teg acg ate egg cag 
pro leu ala thr ala pro ser thr ile arg gin 
451/151 

ttc ggg gca cca gag cac gga ccc ggc atg ccg 
phe gly ala pro glu his gly pro gly met pro 
511/171 

acg ccg gac gag gtc gcg gtg teg age acg ctg 
thr pro asp glu val ala val ser ser thr leu 
571/191 

gtg ggt gcg cgc act ttg egg ate gtc ggc ate 
val gly ala arg thr leu arg ile val gly ile 
631/211 

att ccc aac ate ttc ctg acc acc gaa ggc eta 
ile pro asn ile phe leu thr thr glu gly leu 
691/231 

aca ate agt teg ate ggg ate gac ggg atg ccc 
thr ile ser ser ile gly ile asp gly met pro 
751/251 

gtc aat cga gcg gat get gtc age gat ctg atg 
val asn arg ala asp ala val ser asp leu met 
811/271 

ate acg gtt gtg gcg gtc ttg ctg tgg ate gtt 
ile thr val val ala val leu leu trp ile val 
871/291 

tac etc tct gcg ttg gag egg ctg cgt gac ttt 
tyr leu ser ala leu glu arg leu arg asp phe 
931/311 

acg cgc teg att ctg gee ggg ctg gcg ctg cag 
thr arg ser ile leu ala gly leu ala leu gin 
991/331 

gtg gtt ggc ggc ate ctt teg ctg ctg ttg gcg 
val val gly gly ile leu ser leu leu leu ala 
1051/351 

ccc ctg agt gee ttc gtg gcg eta ccg gcg ate 
pro leu ser ala phe val ala leu pro ala ile 
1111/371 

gtc gca gga ctg egg cgc gtg gtg gcg ate gat 
val ala gly leu arg arg val val ala ile asp 



SEQ ID &oZ> i*oQ-}6>i 
FIGURE 8D 
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ORF predicted by Cole et al . (Nature 393:537-544) and containing Rv2563 

I J^ 1/1 31/11 

"°^^#tag gtt tea aga agg cct gtg cag gtt tec gca gec tgg gec gcg gcg cca ccg aag age 

AMK(val ser arg arg pro val gin val ser ala ala trp ala ala ala pro pro lys ser 
>£&ID Woit53-gY7 2 \ 91/31 

ccg ccg aaa tgg get aat egg gtt cgc ttg get cga teg ccg atg ate teg ace gee acg 
pro pro lys trp ala asn arg val arg leu ala arg ser pro met ile ser thr ala thr 
121/41 151/51 

acc gac ccc etc ace teg gtc gaa cct egg cga acc aac gcg gca acg cca gee cat gat 
thr asp pro leu thr ser val glu pro arg arg thr asn ala ala thr pro ala his asp 
181/61 211/71 

cat ttg att ggg tec acg gaa gca ggt age ttc cgt cgc atg ctt ttt gcg get ttg cgt 
his leu ile gly ser thr glu ala gly ser phe arg arg met leu phe ala ala leu arg 
241/81 271/91 

gat gtc caa tgg cga aaa cga cgc ctt gtc ate gca ate gtc age acc ggc eta gtt ttc 
asp val gin trp arg lys arg arg leu val ile ala ile val ser thr gly leu val phe 
301/101 331/111 

gcg atg acg etc gtt ctg acc gga ctt gtg aac ggg ttt egg gtc gag gee gag cga acc 
ala met thr leu val leu thr gly leu val asn gly phe arg val glu ala glu arg thr 
361/121 391/131 

gtc gat tec atg ggt gtc gac gca ttc gtg gtc aag gee ggc gcg gca gga ccg ttc ctg 
val asp ser met gly val asp ala phe val val lys ala gly ala ala gly pro phe leu 
421/141 451/151 

ggt teg aca cca ttc gee caa ate gac ctg ccc cag gtt get cgt gcg cct ggc gtc ttg 
gly ser thr pro phe ala gin ile asp leu pro gin val ala arg ala pro gly val leu 
481/161 511/171 

get gec gec cca eta gcg act gcg ccg teg acg ate egg cag ggc acg tea gcg cga aac 
ala ala ala pro leu ala thr ala pro ser thr ile arg gin gly thr ser ala arg asn 
541/181 571/191 

gtc acc gcg ttc ggg gca cca gag cac gga ccc ggc atg ccg egg gtc teg gac ggt egg 
1 - ■ val thr ala phe gly ala pro glu .his igly pro gly met pro arg val ser asp. -gly arg .,. 

,601/201 631/211 

gcg cca teg acg ccg gac gag gtc gcg gtg teg age acg ctg ggc cga aac etc ggc gac 
ala pro ser thr pro asp glu val ala val ser ser thr leu gly arg asn leu gly asp 
661/221 691/231 

gat ctg caa gtg ggt gcg cgc act ttg egg ate gtc ggc ate gtg ccc gag tea acc gcg 
asp leu gin val gly ala arg thr leu arg ile val gly ile val pro glu ser thr ala 
721/241 751/251 

ctg gca aag att ccc aac ate ttc ctg acc acc gaa ggc eta cag cag ttg gca tac aac 
leu ala lys ile pro asn ile phe leu thr thr glu gly leu gin gin leu ala tyr asn 
781/261 811/271 

gga cag ccg aca ate agt teg ate ggg ate gac ggg atg ccc cga cag etc ccg gac ggc 
gly gin pro thr ile ser ser ile gly ile asp gly met pro arg gin leu pro asp gly 
841/281 871/291 

tat cag acc gtc aat cga gcg gat get gtc age gat ctg atg cgc ccg ttg aag gtc gcg 
tyr gin thr val asn arg ala asp ala val ser asp leu met arg pro leu lys val ala 
901/301 931/311 

gtg gat gcg ate acg gtt gtg gcg gtc ttg ctg tgg ate gtt gcg gcg ttg ate gtc ggc 
val asp ala ile thr val val ala val leu leu trp ile val ala ala leu ile val gly 
961/321 9S1/331 

teg gtg gtc tac etc tct gcg ttg gag egg ctg cgt gac ttt gcg gtg ttc aag gcg ate 
ser val val tyr leu ser ala leu glu arg leu arg asp phe ala val phe lys ala ile 
1021/341 1051/351 

ggc gtg ccg acg cgc teg att ctg gee ggg ctg gcg ctg cag gcg gtc gtc gtc gcg ctg 
gly val pro thr arg ser ile leu ala gly leu ala leu gin ala val val val ala leu 
1081/361 1111/371 

etc gcg gcg gtg gtt ggc ggc ate ctt teg ctg ctg ttg gcg ccg ttg ttc ccg atg act 
leu ala ala val val gly gly ile leu ser leu leu leu ala pro leu phe pro met thr 
1141/381 1171/391 

gtc gtg gta ccc ctg agt gee ttc gtg gcg eta ccg gcg ate gcg act gtg ate ggt ctg 
val val val pro leu ser ala phe val ala leu pro ala ile ala thr val ile gly leu 
1201/401 1231/411 

ctg gee age gtc gca gga ctg egg cgc gtg gtg gcg ate gat ccg gca eta gcg ttc gga 
leu ala ser val ala gly leu arg arg val val ala ile asp pro ala leu ala phe gly 
1261/421 
ggt ccc tag 
gly pro^AMB 

SEQ ID ':HC&. foZ-ieS 
FIGURL BE 
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sequence of 
more than 77 
1/1 



Rv0072 predicted by Cole et 
% similarity with Seq8D* 



al. (Nature 393:537-544). and exhibit!: 





ttc 


gcg 


gee 


ctg 


cgt 


>f<s> ,0 ' u,€>2 * 61/21 


phe 


ala 


ala 


leu 


arg 














ate age 


ace 


ggg 


ctg 


ate 


ttc 


ile ser 


thr 


gly 


leu 


ile 


phe 


121/41 












egg gtg 


gag 


gee 


egg 


cac 


ace 


arg val 


glu 


ala 


arg 


his 


thr 


181/61 












ggc get 


get 


gga 


cct 


ttt 


ctg 


gly ala 


ala 


gly 


pro 


phe 


leu 


241/81 












gee get 


gaa 


ccc 


ggt 


gtc 


atg 


ala ala 


glu 


pro 


gly 


val 


met 


301/101 












gaa ggc 


acg 


teg 


acg 


cga 


aac 


glu gly 


thr 


ser 


thr 


arg 


asn 


361/121 












cca egg 


gt c 


tea 


gag 


ggt 


egg 


pro arg 


val 


ser 


glu 


gly 


arg 


421/141 












atg ggc 


cga 


cac 


ctc 


ggt 


gac 


met gly 


arg 


his 


leu 


gly 


asp 


481/161 












* ■■ a tt gtg" 


ccg* 


• j aat' 


tec 


■ ace 


gcg 


ile val 


pro 


asn 


ser 


thr 


ala 


541/181 












tta cag 


aaa 


ttg 


gcg 


tac 


aac 


leu gin 


lys 


leu 


ala 


tyr 


asn 


601/201 












ccc cga 


cag 


ctg 


ccg 


gag 


ggt 


pro arg 


gin 


leu 


pro 


glu 


gly 


661/221 












gtg cgc 


cca 


ttg 


aag 


gtc 


gca 


val arg 


pro 


leu 


lys 


val 


ala 


721/241 












gtg gcg 


gtg 


ctg 


ate 


gtc 


ggc 


val ala 


val 


leu 


ile 


val 


gly 


781/261 












ttc gcg 


gtg 


ttc 


aag 


gcg 


att 


phe ala 


val 


phe 


lys 


ala 


ile 


841/281 












cag gcg 


ctg 


gtc 


att 


gcg 


ttg 


gin ala 


leu 


val 


ile 


ala 


leu 


901/301 












gca cca 


ctg 


ttt 


ccg 


atg 


att 


ala pro 


leu 


phe 


pro 


met 


ile 


961/321 












gec gcg 


ate 


gtc 


ate 


ggt 


ctg 


ala ala 


ile 


val 


ile 


gly 


leu 


1021/341 










gat ccc 


gcg 


cag 


gcg 


ttc 


gga 


asp pro 


ala 


gin 


ala 


phe 


gly 



31/11 
tgg aga 
trp arg 
91/31 
ctt gtt 
leu val 
151/51 
atg ggt 
met gly 
211/71 
ccg ttt 
pro phe 
271/91 
ccg ttg 
pro leu 
331/111 
ttc ggc 
phe gly 
391/131 
aaa ccg 
lys pro 
451/151 
gtc ggc 
val gly 
511/171 
ate ccc 
ile pro 
571/191 
aat ate 
asn ile 
631/211 
ttc gat 
phe asp 
691/231 
ate teg 
ile ser 
751/251 
tac ctt 
tyr leu 
811/271 
acg cgc 
thr arg 
871/291 
gtg gtg 
val val 
931/311 
ccc gtc 
pro val 
991/331 
gtt gec 
val ala 



aag cgc cgc ctg gtc ate acg ate 
lys arg arg leu val ile thr ile. 

ttg acc gga ctc gcg aac ggc ttc 
leu thr gly leu ala asn gly phe 

gtc gat gta ttc gtc gtc aga tec 
val asp val phe val val arg ser 

ccc gat gtt gac ctg gec cga gtg 
pro asp val asp leu ala arg val 

ggc age gtg ggg acg ate atg aaa 
gly ser val gly thr ile met lys 

gcg ccc gag cac gga cct ggc atg 
ala pro glu his gly pro gly met 

gac gaa gtc gcg gca teg age acg 
asp glu val ala ala ser ser thr 

gcg cgc aga ttg egg gtc gtt ggc 
ala arg arg leu arg val val gly 

■aat gtc ttc ctc acg acc gag ggc ■ 
asn val phe leu thr thr glu gly 

acg tec ate ggg ate ata ggt atg 
thr ser ile gly ile ile gly met 

egg gtg ggc get gtc aat gat ttg 
arg val gly ala val asn asp leu 

ate gtg get gtt ttg ctg tgg att 
ile val ala val leu leu trp ile 

teg get ctt gag egg eta cgt gac 
ser ala leu glu arg leu arg asp 

teg att atg gee ggg ctc gca tta 
ser ile met ala gly leu ala leu 

ggc gtc gtc ctg gcg cag gtg ttg 
gly val val leu ala gin val leu 

ggt get tac ctg gcg eta ccg gtg 
gly ala tyr leu ala leu pro val 

gga ttg aag cgc gtg gtg acg gtc 
gly leu lys arg val val thr val 



SEQ ID 
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Seq8H : 


ORF 


predicted. by Cole et al 


. (Nature 393:53" 


'-544 


1/1 


















31/11 






tag cct 


ctg 


gga 


atg 


etc 


ttc 


gcg 


gec 


ctg 


cgt gac 


atg 


caa 


AMB (pro 


leu 


gly 


met 


leu 


phe 


ala 


ala 


leu 


arg asp 


met 


gin 


61/21 


















91/31 




gtc ate 


acg 


ate 


ate 


age 


acc 


ggg 


ctg 


ate 


ttc ggg 


atg 


acg 


val ile 


thr 


ile 


ile 


ser 


thr 


gly 


leu 


ile 


phe gly 


met 


thr 


121/41 


















151/51 






gcg aac 


ggc 


ttc 


egg 


gtg 


gag 


gee 


egg 


cac 


acc gtc 


gat 


tec 


ala asn 


gly 


phe 


arg 


val 


glu 


ala 


arg 


his 


thr val 


asp 


ser i 


181/61 


















211/71 




gtc gtc 


aga 


tec 


ggc 


get 


get 


gga 


cct 


ttt 


ctg ggt 


tea 


ata 


val val 


arg 


ser 


gly 


ala 


ala 


gly 


pro 


phe 


leu gly 


ser 


ile 


241/81 


















271/91 






ctg gec 


cga 


gtg 


gec 


get 


gaa 


ccc 


ggt 


gtc 


atg gec 


gcg 


gec 


leu ala 


arg 


val 


ala 


ala 


glu 


pro 


gly 


val 


met ala 


ala 


ala 


301/101 


















331/111 






acg ate 


atg 


aaa 


gaa 


ggc 


acg 


teg 


acg 


cga 


aac gtc 


acg 


gtc 


thr ile 


met 


lys 


glu 


gly 


thr 


ser 


thr 


arg 


asn val 


thr 


val 


361/121 


















391/131 






gga cct 


ggc 


atg 


cca 


egg 


gtc 


tea 


gag 


ggt 


egg tea 


ccg 


teg 


gly pro 


gly 


met 


pro 


arg 


val 


ser 


glu 


gly 


arg ser 


pro 


ser 


421/141 


















451/151 






gca teg 


age 


acg 


atg 


ggc 


cga 


cac 


etc 


ggt 


gac act 


gtc 


gag 


ala ser 


ser 


thr 


met 


gly 


arg 


his 


leu 


gly 


asp thr 


val 


glu 


481/161 


















511/171 




egg gtc 


gtt 


ggc 


att 


gtg 


ccg 


aat 


tec 


acc 


gcg ctg 


gec 


aag 


arg val 


val 


gly 


ile 


val 


pro 


asn 


ser 


thr 


ala leu 


ala 


lys 


541/181 


















571/191 




acg acc 


gag 


ggc 


tta 


cag 


aaa 


ttg 


gcg 


tac 


aac ggg 


cag 


ccg 


thr thr 


glu 


gly 


leu 


gin 


lys 


leu 


ala 


tyr 


asn gly 


gin 


pro 


601/201 


















631/211 






ate ata 


ggt 


atg 


ccc 


cga 


cag 


ctg 


ccg 


gag 


ggt tac 


cag 


act 


ile ile 


gly 


met 


pro 


arg 


gin 


leu 


pro 


glu 


gly tyr 


gin 


thr 


661/221 


















691/231 






gtc aat 


gat 


ttg 


gtg 


cgc 


cca 


ttg 


aag 


gtc 


gca gtg 


aat 


teg 


val asn 


asp 


leu 


val 


arg 


pro 


leu 


lys 


val 


ala val 


asn 


ser 


721/241 


















751/251 






ttg ctg 


tgg 


att 


gtg 


gcg 


gtg 


ctg 


ate 


gtc 


ggc teg 


gtg 


gtg 


leu leu 


trp 


ile 


val 


ala 


val 


leu 


ile 


val 


gly ser 


val 


val 


781/261 


















811/271 


t 




egg eta 


cgt 


gac 


ttc 


gcg 


gtg 


ttc 


aag 


gcg 


att ggc 


acg 


cca 


arg leu 


arg 


asp 


phe 


ala 


val 


phe 


lys 


ala 


ile gly 


thr 


pro 


841/281 


















871/291 






ggg etc 


gca 


tta 


cag 


gcg 


ctg 


gtc 


att 


gcg 


ttg ctt 


gcg 


gcg 


gly leu 


ala 


leu 


gin 


ala 


leu 


val 


ile 


ala 


leu leu 


ala 


ala 


901/301 


















931/311 






gcg cag 


gtg 


ttg 


gca 


cca 


ctg 


ttt 


ccg 


atg 


att gtc 


gcg 


gta 


ala gin 


val 


leu 


ala 


pro 


leu 


phe 


pro 


met 


ile val 


ala 


val 


961/321 


















991/331 






gcg eta 


ccg 


gtg 


gec 


gcg 


ate 


gtc 


ate 


ggt 


ctg ttc 


get 


agt 


ala leu 


pro 


val 


ala 


ala 


ile 


val 


ile 


gly 


leu phe 


ala 


ser 


1021/341 
















1051/351 




gtg gtg 


acg 


gtc 


gat 


ccc 


gcg 


cag 


gcg 


ttc 


gga ggt 


ccc 


tag 


val val 


thr 


val 


asp 


pro 


ala 


gin 


ala 


phe 


gly gly pro) AMB 



-544) and containing seq8G 



tgg 
trp 


aga 
arg 


aag 
lys 


cgc 
arg 


cgc 
arg 


ctg 
leu 


ctt 
leu 


gtt 
val 


ttg 
leu 


acc 
thr 


gga 
gly 


etc 
leu 


atg 
met 


ggt 
gly 


gtc 
val 


gat 
asp 


gta 
val 


ttc 
phe 


ccg 
pro 


ttt 
phe 


ccc 
pro 


gat 
asp 


gtt 
val 


gac 
asp 


ccg 
pro 


ttg 
leu 


ggc 
gly 


age 
ser 


gtg 
val 


ggg 

gly 


ttc 
phe 


ggc 
gly 


gcg 
ala 


ccc 
pro 


gag 
glu 


cac 
his 


aaa 
lys 


ccg 
pro 


gac 
asp 


gaa 
glu 


gtc 
val 


gcg 
ala 


gtc 
val 


ggc 
gly 


gcg 
ala 


cgc 
arg 


aga 
a rg 


ttg 
leu 


ate 
ile 


ccc 
pro 


•aat- 
asn 


gtc 
val 


ttc- 
phe 


etc 
leu 


aat 
asn 


ate 
ile 


acg 
thr 


tec 
ser 


ate 
ile 


ggg 

gly 


ttc 
phe 


gat 
asp 


egg 
arg 


gtg 
val 


ggc 
gly 


get 
ala 


ate 
ile 


teg 
ser 


ate 
ile 


gtg 
val 


get 
ala 


gtt 
val 


tac 
tyr 


ctt 
leu 


teg 
ser 


get 
ala 


ctt 
leu 


gag 
glu 


acg 
thr 


cgc 
arg 


teg 
ser 


att 
ile 


atg 
met 


gec 
ala 


gtg 
val 


gtg 
val 


ggc 
gly 


gtc 
val 


gtc 
val 


ctg 
leu 


ccc 
pro 


gtc 
val 


ggt 
gly 


get 
ala 


tac 
tyr 


ctg 
leu 


gtt 
val 


gec 
ala 


gga 
gly 


ttg 
leu 


aag 
lys 


cgc 
arg 
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FIGURE 8Gj 
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31/11 



U5> 


CGA GGC 


CGA 


GCG 


AAC 


CGT 


CGA 


TTC 


CAT 


GGG 


TGT CGA 


CGC 


ATT 


CGT 


GGT 


CAA 


GGC 


CGG 


CGC 


iT\ il/s jaQ 


61/21 


arc 


ala 


asn 


arg 


a rq 


phe 


his 


Ql V 


cys arg 
91/31 


a r a 


ile 


a r a 


crl v 


aln 


nl v 


a rg 


a rq 




GGC AGG 


ACC 


GTT 


CCT 


GGG 


TTC 


GAC 


ACC 


ATT 


CGC CCA 


AAT 


CGA 


CCT 


GCC 


CCA 


GGT 


TGC 


TCG 




gly arg 


thr 


val 


pro 


gly 


phe 


asp 


thr 


ile 


arg pro 


asn 


arg 


pro 


ala 


pro 


gly 


cys 


ser 




121/41 


















151/51 




















TGC GCC 


TGG 


CGT 


CTT 


GGC 


TGC 


CGC 


CCC 


ACT 


AGC GAC 


TGC 


GCC 


GTC 


GAC 


GAT 


CCG 


GCA 


GGG 




cys ala 


trp 


arg 


leu 


gly 


cys 


arg 


pro 


thr 


ser asp 


cys 


ala 


val 


asp 


asp 


pro 


ala 


gly 




181/61 


















211/71 




















CAC GTC 


AGC 


GCG 


AAA 


CGT 


CAC 


CGC 


GTT 


CGG 


GGC ACC 


AGA 


GCA 


CGG 


ACC 


CGG 


CAT 


GCC 


GCG 




his val 


ser 


ala 


lys 


arg 


his 


arg 


val 


arg 


gly thr 


arg 


ala 


arg 


thr 


arg 


his 


ala 


ala 




241/81 


















271/91 




















GGT CTC 


GGA 


CGG 


TCG 


GGC 


GCC 


ATC 


GAC 


GCC 


GGA CGA 


GGT 


CGC 


GGT 


GTC 


GAG 


CAC 


GCT 


GGG 




gly leu 


gly 


arg 


ser 


gly 


ala 


ile 


asp 


ala 


gly arg 


gly 


arg 


gly 


val 


glu 


his 


ala 


gly 



301/101 

CCG AAA CCT CGG CGA CGA TC 
pro lys pro arg arg arg} 



SEQ ID KcGl/S?-'59 
FIGURE 9A 



as it>ao ,«> gag gcc 


















32/11 


















GAG 


CGA 


ACC 


GTC 


GAT 


TCC 


ATG 


GGT 


GTC GAC 


GCA 


TTC 


GTG 


GTC 


AAG 


GCC 


GGC 


GCG 


^ ib Kd>- (glu ala 


glu 


arg 


thr 


val 


asp 


ser 


met 


gly 


val asp 


ala 


phe 


val 


val 


lys 


ala 


gly 


ala 


62/21 


















92/31 


















GCA GGA 


CCG 


TTC 


CTG 


GGT 


TCG 


ACA 


CCA 


TTC 


GCC CAA 


ATC 


GAC 


CTG 


CCC 


GAG 


GTT 


GCT 


CGT 


ala gly 


pro 


phe 


leu 


gly 


ser 


thr 


pro 


phe 


ala gin 


ile 


asp 


leu 


pro 


gin 


val 


ala 


arg 


122/41 


















152/51 


















GCG CCT 


GGC 


GTC 


TTG 


GCT 


GCC 


GCC 


CCA 


CTA 


GCG ACT 


GCG 


CCG 


TCG 


ACG 


ATC 


CGG 


CAG 


GGC 


ala pro 


gly 


val 


leu 


ala 


ala 


ala 


pro 


leu 


ala thr 


ala 


pro 


ser 


thr 


ile 


arg 


gin 


gly 


182/61 


















212/71 


















ACG TCA 


GCG 


CGA 


AAC 


GTC 


ACC 


GCG 


TTC 


GGG 


GCA CCA 


GAG 


CAC 


GGA 


CCC 


GGC 


ATG 


CCG 


CGG 


thr ser 


ala 


arg 


asn 


val 


thr 


ala 


phe 


gly 


ala pro 


glu 


his 


gly 


pro 


gly 


met 


pro 


arg 


242/81 


















272/91 
















GTC TCG 


GAC 


GGT 


CGG 


GCG 


CCA 


TCG 


ACG 


CCG 


GAC GAG 


GTC 


GCG 


GTG 


TCG 


AGC 


ACG 


CTG 


GGC 


val ser 


asp 


gly 


arg 


ala 


pro 


ser 


thr 


pro 


asp glu 


val 


ala 


val 


ser 


ser 


thr 


leu 


gly 


302/101 




































CGA AAC 


CTC 


GGC 


GAC 


GAT 


C 


























arg asn 


leu 


gly 


asp 


asp) 





























SEQ ID No5.'&0-/6?f 
FIGURE 9B 



30/185 



33/11 



tQ> VD Wo./to^GG CCG 


AGC 


GAA 


CCG 


TCG 


ATT 


CCA 


TGG 


GTG TCG ACG 


CAT 


TCG 


TGG 


TCA 


AGG 


CCG 


GCG 


CGG 




ser 


glu 


pro 


ser 


ile 


pro 


trp 


val ser thr 
93/31 


his 


ser 


trp 


ser 


ara 


oro 


ala 


arg 


CAG GAC 


CGT 


TCC 


TGG 


GTT 


CGA 


CAC 


CAT 


TCG CCC AAA 


TCG 


ACC 


TGC 


CCC 


AGG 


TTG 


CTC 


GTG 


gin asp 
123/41 


arg 


ser 


trp 


val 


arg 


his 


his 


ser pro lys 
153/51 


ser 


thr 


cys 


pro 


arg 


leu 


leu 


val 


CGC CTG 


GCG 


TCT 


TGG 


CTG 


CCG 


CCC 


CAC 


TAG CGA CTG 


CGC 


CGT 


CGA 


CGA 


TCC 


GGC 


AGG 


GCA 


arg leu 
183/61 
CGT CAG 


ala 
CGC 


ser 
GAA 


trp 
ACG 


leu 
TCA 


pro 
CCG 


pro his) AMB (arg leu 

l&Wo./fct/ 213/71 
CGT TCG GGG CAC CAG 


arg 
AGC 


arg 
ACG 


arg 
GAC 


arg 
CCG 


ser 
GCA 


gly 
TGC 


arg 
CGC 


ala 
GGG 


arg gin 
243/81 


arg 


glu 


thr 


ser 


pro 


arg 


ser 


gly his gin 
273/91 


ser 


thr 


asp 


pro 


ala 


cys 


arg 


gly 


TCT CGG 


ACG 


GTC 


GGG 


CGC 


CAT 


CGA 


CGC 


CGG ACG AGG 


TCG 


CGG 


TGT 


CGA 


GCA 


CGC 


TGG 


GCC 


ser arg 
303/101 


thr 


val 


gly 


arg 


his 


arg 


arg 


arg thr arg 


ser 


arg 


cys 


arg 


ala 


arg 


trp 


ala 


GAA ACC 


TCG 


GCG 


ACG 


ATC 


























glu thr. 


, ser 


ala 


thr 


ile) 



























SEQ ID Kcfr. /Co2-/<*Y 
FIGURE 9C 



^TTA ACG 


















31/11 


















ACT 


CAG 


ACG 


GAA 


ACG 


CTT 


GAA 


CCG 


CGA GGT 


CGC 


TCC 


GGA 


CAC 


CAA 


TTT 


GAC 


TCG 


_s^(Leu thr 

C(5v ^i^r 61/21 


thr 


gin 


thr 


glu 


thr 


leu 


glu 


pro 


arg gly 


arg 


ser 


gly 


his 


gin 


phe 


asp 


ser 


















91/31 








\U>U> GCT CTT 


TGG 


CAA 


TTG 


AAG 


GTG 


AGC 


TGC 


GAG 


CAG CCG 


GGT 


GAC 


CGC 


ATC 


GTT 


GGC 


CTT 


GCC 


ala leu 


trp 


gin 


leu 


lys 


val 


ser 


cys 


glu 


gin pro 


gly 


asp 


arg 


ile 


val 


gly 


leu 


ala 


121/41 


















151/51 


















ATC AAT 


CGC 


CGG 


CTC 


GCG 


GAC 


GTA 


GAT 


AAT 


CAG CTC 


ACC 


GTT 


GGG 


ACC 


GAC 


CTC 


GAC 


CAG 


ile asn 


arg 


arg 


leu 


ala 


asp 


val 


asp 


asn 


gin leu 


thr 


val 


gly 


thr 


asp 


leu 


asp 


gin 


181/61 


















211/71 


















GGG TCC 


TTT 


GTG 


ACT 


GCC 


GGG 


CTT 


GAC 


GCG 


GAC GAC 


CAC 


AGA 


GTC 


GGT 


CAT 


CGC 


CTA 


AGG 


gly ser 


phe 


val 


thr 


ala 


gly 


leu 


asp 


ala 


asp asp 


his 


arg 


val 


gly 


his 


arg 


leu 


arg 


241/81 


















271/91 


















CTA CCG 


TTC 


TGA 


CCT 


GGG 


GCT 


GCG 


TGG 


GCG 


CCG ACG 


ACG 


TGA 


GGC 


ACG 


TCA 


TGT 


CTC 


AGC 


leu pro 


phe) OPAYprp 


gly 


ala 


ala 


trp 


ala 


pro thr 


thr) 


OPA 




thr 


ser 


cys 


leu 


ser 


301/101 












331/111 












GGC CCA 


CCG 


CCA 


CCT 


CGG 


TCG 


CCG 


GCA 


GTA 


TGT CAG 


CAT 


GTG 


CAG 


ATG 


ACT 


CCA 


CGC 


AGC 


gly pro 


pro 


pro 


pro 


arg 


ser 


pro 


ala 


val 


cys gin 


his 


val 


gin 


met 


thr 


pro 


arg 


ser 


361/121 


















391/131 


















CTT GTT 


CGC 


ATC 


GTT 


GGT 


GTC 


GTG 


GTT 


GCG 


ACG ACC 


TTG 


GCG 


CTG 


GTG 


AGC 


GCA 


CCC 


GCC 


leu val 


arg 


ile 


val 


gly 


val 


val 


val 


ala 


thr thr 


leu 


ala 


leu 


val 


ser 


ala 


pro 


ala 


421/141 




































GGC GGT 


CGT 


GCC 


GCG 


CAT 


GCG 


GAT 


C 






















gly gly 


arg 


ala 


ala 


his 


ala 


asp) 

























SEQ ID K©s./<fe6-/<e£ 



FIGURE 10A 



31/185 



3 2 / X 1 

££> »I> V^O. '^S^TAA CGA CTC AGA CGG AAA CGC TTG AAC CGC GAG GTC GCT CCG GAC ACC AAT TTG ACT CGG 

— QQHp(arq leu arg arg lys arg leu asn arg glu val ala pro asp thr asn leu thr arg 
£>HuD.lTO 62/21 92/31 

CTC TTT GGC AAT TGA AGG TGA GCT GCG AGC AGC CGG GTG ACC GCA TCG TTG GCC TTG CCA 

leu phe gly asn) OPA arg OPAjfala ala ser ser arg val thr ala ser leu ala leu pro 
122/41 SE/X> »D IU>. 1 11 152/51 

TCA ATC GCC GGC TCG CGG ACG TAG ATA ATC AGC TCA CCG TTG GGA CCG ACC TCG ACC AGG 

ser ile ala gly ser arg thr) AMBj(ile ile ser ser pro leu gly pro thr ser thr arg 
182/61 ^>tO 5> Ho. \7Z 212/71 

GGT CCT TTG TGA CTG CCG GGC TTG ACG CGG ACG ACC ACA GAG TCG GTC ATC GCC TAA GGC 

gly pro leu) OPA^leu pro gly leu thr arg thr thr thr glu ser val ile ala)oCH/gly 
242/81 j0 272/91 Sf ^ ^ y 

TAC CGT TCT GAC CTG GGG CTG CGT GGG CGC CGA CGA CGT GAG GCA CGT CAT GTC TCA GCG 

tyr arg ser asp leu gly leu arg gly arg arg arg arg glu ala arg his val ser ala 
302/101 332/111 

GCC CAC CGC GAC CTC GGT CGC CGG CAG TAT GTC AGC ATG TGC AGA TGA CTC CAC GCA GCC 

ala his arg his leu gly arg arg gin tyr val ser met cys arg) OPA (leu his ala ala 
362/121 392/131 it/Vtfx M5 

TTG TTC GCA TCG TTG GTG TCG TGG TTG CGA CGA CCT TGG CGC TGG TGA GCG CAC CCG CCG 

leu phe ala ser leu val ser trp leu arg arg pro trp arg trp) OPA ^la his pro pro 

422/141 -Va^TM! i to* 

GCG GTC GTG CCG CGC ATG CGG ATC *>C<» LP Htf.KW. 

ala val val pro arg met arg Ile) 



SEQ ID No*.) TO -17(0 
FIGURE 10B 

33/11 



!t>i^c. IT?** AAC GAC 


TCA 


GAC 


GGA 


AAC 


GCT TGA 


ACC 


GCG 


AGG TCG 


CTC 


CGG 


ACA 


CCA 


ATT 


TGA 


CTC 


GGC 


^asn asp 


ser 


asp 


gly 


asn 


ala) OPA_(thr 


ala 


arg ser 


leu 


arg 


thr 


pro 


ile) 


OPA 


deu 


gly 


>£T<S? /DKo./W63/21 












93/31 












TCT TTG 


GCA 


ATT 


GAA 


GGT 


GAG CTG 


CGA 


GCA 


GCC GGG 


TGA 


CCG 


CAT 


CGT 


TGG 


CCT 


TGC 


CAT 


ser leu 


ala 


ile 


glu 


gly 


glu leu 


arg 


ala 


ala gly} 


OPAfrro 


his 


arg 


trp 


pro 


cys 


his 


123/41 
















153/51 




J -5/ 










CAA TCG 


CCG 


GCT 


CGC 


GGA 


CGT AGA 


TAA 


TCA 


GCT CAC 


CGT 


TGG 


GAC 


CGA 


CCT 


CGA 


CCA 


GGG 


gin ser 


pro 


ala 


arg 


gly 


arg arg) OCH (ser 


ala his 


arg 


trp 


asp 


arg 


pro 


arg 


pro 


gly 


183/61 












213/71 


















GTC CTT 


TGT 


GAC 


TGC 


CGG 


GCT TGA 


CGC 


GGA 


CGA CCA 


CAG 


AGT 


CGG 


TCA 


TCG 


CCT 


AAG 


GCT 


val leu 


cys 


asp 


cys 


arg 


ala) OPA prg 


gly 


arg pro 


gin 


ser 


arg 


ser 


ser 


pro 


lys 


ala 


243/81 












273/91' 


















ACC GTT 


CTG 


ACC 


TGG 


GGC 


TGC GTG 


GGC 


GCC 


GAC GAC 


GTG 


AGG 


CAC 


GTC 


ATG 


TCT 


CAG 


CGG 


thr val 


leu 


thr 


trp 


gly 


cys val 


gly 


ala 


asp asp 


val 


arg 


his 


val 


met 


ser 


gin 


arg 


303/101 
















333/111 


















CCC ACC 


GCC 


ACC 


TCG 


GTC 


GCC GGC 


AGT 


ATG 


TCA GCA 


TGT 


GCA 


GAT 


GAC 


TCC 


ACG 


CAG 


CCT 


pro thr 


ala 


thr 


ser 


val 


ala gly 


ser 


met 


ser ala 


cys 


ala 


asp 


asp 


ser 


thr 


gin 


pro 


363/121 
















393/131 


















TGT TCG 


CAT 


CGT 


TGG 


TGT 


CGT GGT 


TGC 


GAC 


GAC CTT 


GGC 


GCT 


GGT 


GAG 


CGC 


ACC 


CGC 


CGG 


cys ser 


his 


arg 


trp 


cys 


arg gly 


cys 


asp 


asp leu 


gly 


ala 


gly 


glu 


arg 


thr 


arg 


arg 


423/141 


































CGG TCG 


TGC 


CGC 


GCA 


TGC 


GGA TC 
























arg ser 


cys 


arg 


ala 


cys 


giy) 

























SEQ ID No©./77-/*3 



FIGURE IOC 



32/185 



iG >D*£>- ccc GAA GAG GTC CCC CGT TTT GTT AAT TTT TAA AAA ATT TGT GTC ACA AAC CGG GGT ACC 

s> ir> *u ,o>*(P ro glu glu val pro arg P he val as P phe) OCK.(lys ile cys val thr lys arg gly thr 
^•'* 5 61/21 t>c$ ID >4o./8^T731 

AAG GCA TAA AAC CTA GTA CCT GGG GCG GCG GAT TCA ACG AAA ACC GAG TGG GGG TAG TCA 
lys ala) oCH^ksn leu val pro gly ala ala asp ser thr lys thr glu trp gly)AMB/ser 
121/41 Sfc^/^Kc. /ST 151/51 Se& to tk>~ i?^^ 

GGG GCG TGC ATT CCG ACG ACC CTG TAC GAC CCG CTG GTG GCA ACG CCG ATG AGT GCG CCG 
gly ala cys ile pro thr thr leu tyr asp pro leu val ala thr pro met ser ala pro 
181/61 211/71 

ACG AAG GCC GAG CGA CGG GCT GCC GGC GCT GAC CGC CGC GGA AGC CGC CGA GTG CAT GGT 
thr lys ala glu arg arg ala ala gly ala asp arg arg gly ser arg arg val asp gly 
• 241/81 271/91 

CAC CAC CGC CCG CAC CCG ACC GGT ACG GAT CGC GCC TCG GGT TAC CGT CGC CGT CAA CGC 
his his arg pro his pro thr gly thr asp arg ala ser gly tyr arg arg arg gin arg 
301/101 331/111 

GCT GGA CAG CAT CGG TCC CCG CTG GGT CAA TGC ACT CAT GCA GCG CCG CAA CGA ACA GCT 
ala gly gin his arg ser pro leu gly gin cys thr his ala ala pro gin arg thr ala 
361/121 391/131 

CAA CCC TTG AAC CGG GTC CCG GCC TGC CGA CCC TCG GCC GCC GGC GTG CCG CTA CGT GAT 
gin pro leu asn arg val pro ala cys arg pro ser ala ala gly val pro leu arg asp 
421/141 451/151 

AGA CAC AGG GCC ATG GAA ATC CTG GCC AGC CGG ATG CTA CTT CGG CCG GCG GAC TAT CAG 
arg his arg ala met glu ile leu ala ser arg met leu leu arg pro ala asp tyr gin 
481/161 

CGG TCG CTG AGC TTC TAC CGT GAC CAG ATC 
arg ser leu ser phe tyr arg asp gin ile) 

SEQ ID *03-W-/8S 
FIGURE 11A 



/B9-5-CCG AAG 


















32/11 


















AGG 


TCC 


CCC 


GTT 


TTG 


TTA 


ATT 


TTT 


AAA AAA 


TTT 


GTG 


TCA 


CAA 


AGC 


GGG 


GTA 


CCA 


(pro lys 
62/21 


arg 


ser 


pro 


val 


leu 


leu 


ile 


phe 


lys lys 


phe 


val 


ser 


gin 


ser 


gly 


val 


pro 


















92/31 


















AGG CAT 


AAA 


ACC 


TAG 


TAC 


CTG 


GGG 


CGG 


CGG 


ATT CAA 


CGA 


AAA 


CCG 


AGT 


GGG 


GGT 


AGT 


CAG 


arg his 


lys 


thr) 


AMB 


(tyr 


leu 


gly 


arg 


arg 


ile gin 


arg 


lys 


pro 


ser 


gly 


gly 


ser 


gin 


122/41 






eg 


S ;D fro* ft* 






152/51 


















GGG CGT 


GCA 


TTC 


CGA 


CGA 


CCC 


TGT 


ACG 


ACC 


CGC TGG 


TGG 


CAA 


CGC 


CGA 


TGA 


GTG 


CCC 


CGA 


gly arg 


ala 


phe 


arg 


arg 


pro 


cys 


thr 


thr 


arg trp 


trp 


gin 


arg 


arg) 


lOPAj 


(val 


arg 


arg 


182/61 


















212/71 












CGA AGG 


ccG 


AGC 


GAC 


GGG 


CTG 


CCG 


GCG 


CTG 


ACC GCC 


GCG 


GAA 


GCC 


GCC 


GAG 


TGG 


ATG 


GTC 


arg arg 


pro 


ser 


asp 


gly 


leu 


pro 


ala 


leu 


thr ala 


ala 


glu 


ala 


ala 


glu 


trp 


met 


val 


242/81 


















272/91 


















ACC ACC 


GCC 


CGC 


ACC 


CGA 


CCG 


GTA 


CGG 


ATC 


GCG CCT 


CGG 


GTT 


ACC 


GTC 


GCC 


GTC 


ACC 


GCG 


thr thr 


ala 


arg 


thr 


arg 


pro 


val 


arg 


ile 


ala pro 


arg 


val 


thr 


val 


ala 


val 


asn 


ala 


302/101 


















332/111 


















CTG GAC 


AGC 


ATC 


GGT 


CCC 


CGC 


TGG 


GTC 


AAT 


GCA CTC 


ATG 


CAG 


CGC 


CGC 


AAC 


GAA 


CAG 


CTC 


leu asp 


ser 


ile 


gly 


pro 


arg 


trp 


val 


asn 


ala leu 


met 


gin 


arg 


arg 


asn 


glu 


gin 


leu 


362/121 


















392/131 


















AAC CCT 


TGA 


ACC 


GGG 


TCC 


CGG 


CCT 


GCC 


GAC 


CCT CGG 


CCG 


CCG 


GCG 


TGC 


CGC 


TAC 


GTG 


ATA 


asn pro) 


1 OPA (thr 


gly 


ser 


arg 


pro 


ala 


asp 


pro arg 


pro 


pro 


ala 


cys 


arg 


tyr 


val 


ile 


422/141 












452/151 


















GAC ACA 


GGG 


CCA 


TGG 


AAA 


TCC 


TGG 


CCA 


GCC 


GGA TGC 


TAC 


TTC 


GCC 


CGG 


CGG 


ACT 


ATC 


AGC 


asp thr 


gly 


pro 


trp 


lys 


ser 


trp 


pro 


ala 


gly cys 


tyr 


phe 


gly 


arg 


arg 


thr 


ile 


ser 



482/161 

GGT CGC TGA GCT TCT ACC GTG ACC AGA TC 
gly arg)oPA^ala ser thr val thr argj? 
?>6<S> ID Ho- /<?<-/ 



SEQ ID A/cs.i*q-fW 



FIGURE 11B 



33/185 



33/11 

lL>^Cw?3^ CGA AGA GGT CCC CCG TTT TGT TAA TTT TTA AAA AAT TTG TGT CAC AAA GCG GGG TAC CAA 
/t)t^o /<^b-^( ar 9 arg gly pro pro phe cys ) oc Hy(p he leu l vs asn leu cvs nis l vs ala gly tyr gin 
63/21 3£&/&K©i97 93 / 31 

GGC ATA AAA CCT AGT ACC TGG GGC GGC GGA TTC AAC GAA AAC CGA GTG GGG GTA GTC AGG 
gly ile lys pro ser thr trp gly gly gly phe asn glu asn arg val gly val val arg 
123/41 153/51 

GGC GTG CAT TCC GAC GAC CCT GTA CGA CCC GCT GGT GGC AAC GCC GAT GAG TGC GCC GAC 
gly val his ser asp asp pro val arg pro ala gly gly asn ala asp glu cys ala asp 
183/61 213/71 

GAA GGC CGA GCG ACG GGC TGC CGG CGC TGA CCG CCG CGG AAG CCG CCG AGT GGA TGG TCA 
glu gly arg ala thr gly cys arg arg)OPA(pro pro arg lys pro pro ser gly trp ser 
243/81 Se<3> |D Ho. (Q^^ 273/91 

CCA CCG CCC GCA CCC GAC CGG TAC GGA TCG CGC CTC GGG TTA CCG TCG CCG TCA ACG CGC 
pro pro pro ala pro asp arg tyr gly ser arg leu gly leu pro ser pro ser thr arg 
303/101 333/111 

TGG ACA GCA TCG GTC CCC GCT GGG TCA ATG CAC TCA TGC AGC GCC GCA ACG AAC AGC TCA 
trp thr ala ser val pro ala gly ser met his ser cys ser ala ala thr asn ser ser 
363/121 393/131 

ACC CTT GAA CCG GGT CCC GGC CTG CCG ACC CTC GGC CGC CGG CGT GCC GCT ACG TGA TAG 
thr leu glu pro gly pro gly leu pro thr leu gly arg arg arg ala ala thr)oPA AMB 
423/141 453/151 

ACA CAG GGC CAT GGA AAT CCT GGC CAG CCG GAT GCT ACT TCG GCC GGC GGA CTA TCA GCG 
|TW (QQ ( thr gln gly his gly aSn pr ° gly gln pr ° aSp ala thr Ser ala gly gly leU Ser ala 

GTC GCT GAG CTT CTA CCG TGA CCA GAT C 
val ala glu leu leu pro) OPA pro asp 



SEQ ID (<o5. '96- /Q9 



FIGURE 11C 



part of the nucleotide sequence of Seqll 

1/1 31/11 



>£& iD Ko* 'ZCD-xzgt CGC 


CGT 


CAA 


CGC 


GCT 


GGA 


CAG 


CAT 


CGG 


TCC CCG 


CTG 


GGT 


CAA 


TGC 


ACT 


CAT 


GCA 


GCG 


^arg arg 
CCG CAA 


arg 


gin 


arg 


ala 


gly 


gin 


his 


arg 


ser pro 


leu 


gly 


gin 


cys 


thr 


his 


ala 


ala 


















91/31 


















CGA 


ACA 


GCT 


CAA 


CCC 


TTG 


AAC 


CGG 


GTC CCG 


GCC 


TGC 


CGA 


CCC 


TCG 


GCC 


GCC 


GGC 


pro gin 


arg 


thr 


ala 


gin 


pro 


leu 


asn 


arg 


val pro 


ala 


cys 


arg 


pro 


ser 


ala 


ala 


gly 


121/41 


















151/51 


















GTG CCG 


CTA 


CGT 


GAT 


AGA 


CAC 


AGG 


GCC 


ATG 


GAA ATC 


CTG 


GCC 


AGC 


CGG 


ATG 


CTA 


CTT 


CGG 


val pro 


leu 


arg 


asp 


arg 


his 


arg 


ala 


met 


glu ile 


leu 


ala 


ser 


arg 


met 


leu 


leu 


arg 


181/61 


















211/71 


















CCG GCG 


GAC 


TAT 


CAG 


CGG 


TCG 


CTG 


AGC 


TTC 


TAC CGT 


GAC 


CAG 


ATC 












pro ala 


asp 


tyr 


gin 


arg 


ser 


leu 


ser 


phe 


tyr arg 


asp 


gin 


ile] 


) 











SEQ ID £los gjoo-zo* 
FIGURE 11A 1 



34/185 



1/1 31/11 



f/al ala val asn ala leu asp ser ile gly pro arg 

91/31 

^^°ZCGC AAC GAA CAG CTC AAC CCT TGA ACC GGG TCC CGG 



121/41 ST® it> Ki- 2.CH 151/51 

TGC CGC TAC GTG ATA GAC ACA GGG CCA TGG AAA TCC 
cys arg tyr val ile asp thr gly pro trp lys ser 
181/61 211/71 
CGG CGG ACT ATC AGC GGT CGC TGA GCT TCT ACC GTG 
arg arg thr ile ser gly argJOPA(^ala ser thr val 

St to j£> Ko. 2Q5> 



TGG 


GTC 


AAT 


GCA 


CTC 


ATG 


CAG 


CGC 


trp 


val 


asn 


ala 


leu 


met 


gin 


arg 


CCT 


GCC 


GAC 


CCT 


CGG 


CCG 


CCG 


GCG 


pro 


ala 


asp 


pro 


arg 


pro 


pro 


ala 


TGG 


CCA 


GCC 


GGA 


TGC 


TAC 


TTC 


GGC 


trp 


pro 


ala 


gly 


cys 


tyr 


phe 


gly 


ACC 


AGA 


TC 












thr 


arg) 















SEQ ID :& 0Si <Z&2-205 
FIGURE 11B' 











31/11 
















ACA 


GCA 


TCG 


GTC 


CCC GCT GGG 


TCA 


ATG 


CAC 


TCA 


TGC 


AGC 


GCC 


thr 


ala 


ser 


val 


pro ala gly 
91/31 


ser 


met 


his 


ser 


cys 


ser 


ala 


CTT 


GAA 


CCG 


GGT 


CCC GGC CTG 


CCG 


ACC 


CTC 


GGC 


CGC 


CGG 


CGT 


leu 


glu 


pro 


gly 


pro gly leu 
151/51 


pro 


thr 


leu 


gly 


arg 


arg 


arg 


CAG 


GGC 


CAT 


GGA 


AAT CCT GGC 


CAG 


CCG 


GAT 


GCT 


ACT 


TCG 


GCC 


gin 


gly 


his 


gly 


asn pro gly 
211/71 


gin 


pro 


asp 


ala 


thr 


ser 


ala 


GCT 


GAG 


CTT 


CTA 


CCG TGA CCA 


GAT 


C 












ala 


glu 


leu 


leu 


pro) OPA pro 


asp 















1/1 

i 6: fD 'Ao. f£fc>yrcG ccg tca acg cgc tgg 

^(ser pro ser thr arg trp 

A.o 201 61/21 

GCA ACG AAC AGC TCA ACC 

ala thr asn ser ser thr 

121/41 

dCC GCT ACG TGA TAG ACA 
ala ala thr)oi>Aj(AMB thr 
181/61 ' <Se^'tO 2^® 
GGC GGA CTA TCA GCG GTC 



SEQ ID Kerb, £6c^- 2&& 
FIGURE 11C 

sequence Rv054 6c predicted by Cole et al . (Nature 393:537-544) and containing SeqllA 1 

1/1 31/11 

*-<Y Lb \Ao- 2CQ->atq gaa ate ctg gec age egg atg eta ctt egg ccg gcg gac tat cag egg teg ctg age 

^Vlet glu ile leu ala ser arg met leu leu arg pro ala asp tyr gin arg ser leu ser 
*'0 61/21 91/31 

ttc tac cgt gac cag ate ggg ctg gcg att gec cgt gaa tac ggg gec ggc aca gtg ttt 

phe tyr arg asp gin ile gly leu ala ile ala arg glu tyr gly ala gly thr val phe 

121/41 151/51 

ttc gec ggt cag tca ctg etc gaa ctg gee ggt tac ggc gag ccg gac cat teg egg gga 

phe ala gly gin ser leu leu glu leu ala gly tyr gly glu pro asp his ser arg gly 

181/61 211/71 

cct ttt ccc ggc gcg ctg tgg ctg cag gtg cgc gac etc gag get acc cag acc gag ctg 

pro phe pro gly ala leu trp leu gin val arg asp leu glu ala thr gin thr glu leu 

241/81 271/91 

gtc age cga ggc gtg teg ate get cgc gag ccc cgc cgc gaa ccg tgg ggc ctg cac gag 

val ser arg gly val ser ile ala arg glu pro arg arg glu pro trp gly leu his glu 

301/101 331/111 

atg cat gtg acc gac cca gac ggg ate aca ctg ata ttc gtc gag gtt ccc gag ggt cac 

met his val thr asp pro asp gly ile thr leu ile phe val glu val pro glu gly his 
361/121 

ccg ctg cgt aca gac acc egg gcg tga 
pro leu arg thr asp thr arg ala) OPA 

SEQ ID lfa& m Ot>Q-ZlO 



FIGURE 11D 



35/185 



ORF predicted by Cole et al . (Nature 393:537-544) and containing Rv0546c 



1/1 


















31/11 


















<3£c£ \"D Ho. tag tea 


ggg 


cgt 


gca 


ttc 


gac 


gac 


get 


gta 


eta ccc 


get 


ggt 


ggc 


aac 


tec 


gat 


gat 


tgc 


AMB £er 


gly 


arg 


ala 


phe 


asp 


asp 


ala 


val 


leu pro 


ala 


gly 


gly 


asn 


ser 


asp 


asp 


cys 


61/21 


















91/31 












gec gac 


gaa 


ggc 


eta 


cga 


egg 


get 


gee 


ggc 


get gac 


cgc 


cgc 


gga 


age 


cgc 


cga 


gtg 


gat 


ala asp 


glu 


gly 


leu 


arg 


arg 


ala 


ala 


gly 


ala asp 


arg 


arg 


gly 


ser 


arg 


arg 


val 


asp 


121/41 


















151/51 
















ggt cac 


cgc 


cgc 


ccg 


cac 


ccg 


acc 


ggt 


gcg 


gat cgc 


gec 


teg 


ggt 


tgc 


cgt 


cgc 


cgt 


caa 


gly his 


arg 


arg 


pro 


his 


pro 


thr 


gly 


ala 


asp arg 


ala 


ser 


gly 


cys 


arg 


arg 


arg 


gin 


181/61 


















211/71 














cgc get 


gga 


cag 


cat 


egg 


tec 


ccg 


ctg 


ggt 


caa tgc 


act 


cat 


gca 


gcg 


ccg 


caa 


cga 


aca 


arg ala 


gly 


gin 


his 


arg 


ser 


pro 


leu 


gly 


gin cys 


thr 


his 


ala 


ala 


pro 


gin 


arg 


thr 


241/81 


















271/91 


















get caa 


ccc 


ttg 


aac 


egg 


gtc 


ccg 


gee 


tgc 


cga ccc 


teg 


gee 


gee 


ggc 


gtg 


ccg 


eta 


' cgt 


ala gin 


pro 


leu 


asn 


arg 


val 


pro 


ala 


cys 


arg pro 


ser 


ala 


ala 


gly 


val 


pro 


leu 


a rg 


301/101 


















331/111 
















gat aga 


cac 


agg 


gee 


atg 


gaa 


ate 


ctg 


gec 


age egg 


atg 


eta 


ctt 


egg 


ccg 


gcg 


gac 


tat 


asp arg 


his 


arg 


ala 


met 


glu 


ile 


leu 


ala 


ser arg 


met 


leu 


leu 


arg 


pro 


ala 


asp 


tyr 


361/121 


















391/131 


















cag egg 


teg 


ctg 


age 


ttc 


tac 


cgt 


gac 


cag 


ate ggg 


ctg 


gcg 


att 


gec 


cgt 


gaa 


tac 


ggg 


gin arg 


ser 


leu 


ser 


phe 


tyr 


arg 


asp 


gin 


ile gly 


leu 


ala 


ile 


ala 


arg 


glu 


tyr 


gly 


421/141 


















451/151 


















gee ggc 


aca 


gtg 


ttt 


ttc 


gee 


ggt 


cag 


tea 


ctg etc 


gaa 


ctg 


gec 


ggt- 


• tac 


gge- gag 1 ccg 


ala gly 


thr 


val 


phe 


phe 


ala 


gly 


gin 


ser 


leu leu 


glu 


leu 


ala 


gly 


tyr 


gly 


glu 


pro 


481/161 


















511/171 


















gac cat 


teg 


egg 


gga 


cct 


ttt 


ccc 


ggc 


gcg 


ctg tgg 


ctg 


cag 


gtg 


cgc 


gac 


etc 


gag 


get 


asp his 


ser 


arg 


gly 


pro 


phe 


pro 


gly 


ala 


leu trp 


leu 


gin 


val 


arg 


asp 


leu 


glu 


ala 


541/181 


















571/191 


















acc cag 


acc 


gag 


ctg 


gtc 


age 


cga 


ggc 


gtg 


teg ate 


get 


cgc 


gag 


ccc 


cgc 


cgc 


gaa 


ccg 


thr gin 


thr 


glu 


leu 


val 


ser 


arg 


gly 


val 


ser ile 


ala 


arg 


glu 


pro 


arg 


arg 


glu 


pro 


601/201 


















631/211 


















tgg ggc 


ctg 


cac 


gag 


atg 


cat 


gtg 


acc 


gac 


cca gac 


ggg 


ate 


aca 


ctg 


ata 


ttc 


gtc 


gag 


trp gly 


leu 


his 


glu 


met 


his 


val 


thr 


asp 


pro asp 


gly 


ile 


thr 


leu 


ile 


phe 


val 


glu 


661/221 


















691/231 


















gtt ccc 


gag 


ggt 


cac 


ccg 


ctg 


cgt 


aca 


gac 


acc egg 


gcg^ 


tga 














val pro 


glu 


gly 


his 


pro 


leu 


arg 


thr 


asp 


thr arg 


ala 


)OPA 















SEQ ID lUz>> £.H~&I2- 
FIGURE HE 



36/185 



1/1 

1/1 


















31/11 




















gat 


ttc 


gcg 






teg 


gcc 


tgt aag gca 


acg 


cga 


ggt 


ctt 


cat 


gcc 


gag 


Jpsp arg 
_ ' 61/21 


arg 


asp 


phe 


a .La 


-t- l» *- 

tnr 


asn 


ser 


ala 


cys lys ala 


thr 


arg 


gly 


leu 


his 


ala 


glu 


















91/31 
































tga 
















gac gta 


gac 


agg 


aag 


aga 


cag 


gga 


age 


tga cgt cgc 


gta 


ccg 


gac 


cgc 


cat 


tct 


gtc 


asp va 1 


asp 


arg 


lys 


arg 


gin 


gly 


ser] 




OPA (arg arg 


val 


pro 


asp 


arg 


his 


ser 


val 


121/41 


















151/51%£Cv ibuvo. 








gag tct 


ttc 


cga 


gtt 


cag 


caa 


caa 


teg 


aca 


cag aag egg 


gga 


cca 


gac 


egg 


gag 


gac 


gac 


glu ser 


phe 


arg 


val 


gin 


gin 


gin 


ser 


thr 


gin lys arg 


gly 


pro 


asp 


arg 


glu 


asp 


asp 


• 181/61 


















211/71 
















gcg gcc 


egg 


gcc 


get 


teg 


ggc 


cga 


gtg 


tct 


gag taa gac 


cag 


agt 


cac 


ggg 


tec 


gtg 


tgt 


ala ala 


arg 


ala 


ala 


ser 


gly 


arg 


val 


ser 


gluj OCH (asp 


gin 


ser 


his 


gly 


ser 


val 


cys 


241/81 


















271/91 gg© 












gac aac 


cgc 


gcg 


gaa 


ttc 


aat 


egg 


atg 


gcg 


ggc ggg acc 


gga 


ttg 


cgc 


egg 


tea 


ccg 


agg 


asp asn 


arg 


ala 


glu 


phe 


asn 


arg 


met 


ala 


gly gly thr 


gly 


leu 


arg 


arg 


ser 


pro 


arg 


301/101 
































aac etc 


egg 


agt 


gat c 
asp) 


























asn leu 


arg 


ser 



























SEQ ID .Acs> Z\S'2l(a 
FIGURE 12A 



1/1 31/11 



\ v:> iVo2.'7-*icc gaa 


ggg 


att 


teg 


cga 


eta 


act 


egg 


cct 


gta agg 


caa 


cgc 


gag 


gtc 


ttc 


atg 


ccg 


agg 


x ^/thr glu 
it>\*o2i8 61/21 


gly 


ile 


ser 


arg 


leu 


thr 


arg 


pro 


val arg 


gin 


arg 


glu 


val 


phe 


met 


pro 


arg 


















91/31 


















acg tag 


aca 


gga 


aga 


gac 


agg 


gaa 


get 


gat 


gac gtc 


gcg 


tac 


egg 


acc 


gcc 


att 


ctg 


teg 


thr JAMB /thr 


gly 


arg 


asp 


arg 


glu 


ala 


asp 


asp val 


ala 


tyr 


arg 


thr 


ala 


ile 


leu 


ser 


121/41 setf to v^o 


• 2i<? 












151/51 


















agt ctt 


tec 


gag 


ttc 


age 


aac 


aat 


cga 


cac 


aga age 


ggg 


gac 


cag 


acc 


ggg 


agg 


acg 


acg 


ser leu 


ser 


glu 


phe 


ser 


asn 


asn 


arg 


his 


arg ser 


gly 


asp 


gin 


thr 


gly 


arg 


thr 


thr 


181/61 


















211/71 


















egg ccc 


ggg 


ccg 


ctt 


egg 


gcc 


gag 


tgt 


ctg 


agt aag 


acc 


aga 


gtc 


acg 


ggt 


ccg 


tgt 


gtg 


arg pro 


giy 


pro 


leu 


arg 


ala 


glu 


cys 


leu 


ser lys 


thr 


arg 


val 


thr 


gly 


pro 


cys 


val 


241/81 


















271/91 


















aca acc 


gcg 


egg 


aat 


tea 


ate 


gga 


tgg 


egg 


gcg gga 


ccg 


gat 


tgc 


gcc 


ggt 


cac 


cga 


gga 


thr thr 


ala 


arg 


asn 


ser 


ile 


gly 


trp 


arg 


ala gly 


pro 


asp 


cys 


ala 


gly 


his 


arg 


gly 


301/101 




































acc tec 


gga 


gtg 


ate 






























thr ser 


gly 


val 


ile) 































SEQ ID .^.fiM-Z^ 



FIGURE 12B 



37/185 



1/1 31/11 



Jc&> C£> \Ao.2&-?ccq aag 


gga 


ttt 


cgc 


gac 


taa 


etc 


ggc 


ctg 


taa ggc aac 


gcg 


agg 


tct 


tea 


tgc 


cga 


gga 


z^ro lys 


gly 


pne 


arg 


asp) 


OCH 


leu 


gly 


leu 


OCH (jlv asn ala 
91/31 %e G> © Wo 


arg ser 
. 2 22. 


ser 


cy s 


arg 


giy 


cgt aga 


cag 


gaa 


gag 


aca 


ggg 


aag 


ctg 


atg 


acg teg cgt 


ace 


gga 


ccg 


cca 


ttc 


tgt 


cga 


arg arg 


gin 


glu 


glu 


thr 


gly 


lys 


leu 


met 


thr ser arg 


thr 


gly 


pro 


pro 


phe 


cys 


arg 


121/41 


















151/51 
















gtc ttt 


ccg 


agt 


tea 


gca 


aca 


ate 


gac 


aca 


gaa gcg ggg 


ace 


aga 


ccg 


gga 


gga 


cga 


cgc 


val phe 


pro 


ser 


ser 


ala 


thr 


ile 


asp 


thr 


glu ala gly 


thr 


arg 


pro 


gly 


gly 


arg 


arg 


181/61 


















211/71 
















ggc ccg 


ggc 


cgc 


ttc 


ggg 


ccg 


agt 


gtc 


tga 


gta aga cca 


gag 


tea 


egg 


gtc 


cgt 


gtg 


tga 


gly pro 


gly 


arg 


phe 


gly 


pro 


ser 


val) 


OPA 


(val^arg pro 


glu 


ser 


arg 


val 


arg 


val) 


OPA 


241/81 


















271/9TSe^ 












caa ccg 


cgc 


gga 


att 


caa 


teg 


gat 


ggc 


ggg 


egg gac egg 


att 


gcg 


ccg 


gtc 


ace 


gag 


gaa 


/gin pro 


arg 


gly 


ile 


gin 


ser 


asp 


gly 


gly 


arg asp arg 


ile 


ala 


pro 


val 


thr 


glu 


glu 



cct ccg gag tga tc 



pro pro glu OPA) 

SEQ ID ^ ?£C- Z2M 



FIGURE 12C 



1/1 31/11 
lt> KjO. 225*GGG ATT TCG TTG CCC GAT GGA TTG TTT GTA CGG TTT GGG AAA AAC ACT TGA AGT CCT TTT 

u ?9--^7&ly H e ser leu P ro as P 9 lv leu P he val ar 9 P he 9ly lvs asn thr) OPA (ser pro phe 
(S> fc>*6. ZZ«T v i/21 91/31 6£A^^7 

1 ' TAT TGG CAA TGC TGG AAA TGG ACA ( TTC 1 CAA TAT TGC GCG AAT TAA CCG AAC ACG - GTG AGO- 

tyr trp gin cys trp lys trp thr phe gin tyr cys ala asn)0CH^>ro asn thr val arg 
121/41 151/51 €£& &*o-4££ 

GGG GGG CAA GCG TTT GTA CCG GGG CCA GCA AGC GCC GCC GAC CGG TTG ACC GAA GCC AGC 

giy giy gi n a i a P ne vai P ro giy P ro aia ser aia a i a as P ar< ? i€U thr a i a ser 

181/61 211/71 

ATG TTG TTG TGT CAG CGC GGG CTT GGT CTC GAT GTC CCG GCC TTG GCT GGA CCC GCT TCT 
met leu leu cys gin arg gly leu gly leu asp val pro ala leu ala gly pro ala ser 
241/81 271/91 

TCA AAA CAG GTT GAA CTT AAC GAC TCA AGA ACG GAA ACG CTT GAA CCG CGA CGT CGC TCC 
ser lys gin val glu leu asn asp ser arg thr glu thr leu glu pro arg arg arg ser 
301/101 331/111 

GGA CAC CAA TTT GAC TCG GCT CTT TGG CAA TTG AAG GTG AGC TGC GAG CAG CCG GGT GAC 
gly his gin phe asp ser ala leu trp gin leu lys val ser cys glu gin pro gly asp 
361/121 391/131 

CGC ATC GTT GGC CTT GCC ATC AAT CGC CGG CTC GCG GAC GTA GAT AAT CAG CTC ACC GTT 
arg ile val gly leu ala ile asn arg arg leu ala asp val asp asn gin leu thr val 
421/141 451/151 

GGG ACC GAC CTC GAC CAG GGG TCC TTT GTG ACT GCC GGG CTT GAC GCG GAC GAC CAC AGA 
gly thr asp leu asp gin gly ser phe val thr ala gly leu asp ala asp asp his arg 
481/161 511/171 

GTC GGT CAT CGC CTA AGG CTA CCG TTC TGA CCT GGG GCT GCG TGG GCG CCG ACG ACG TGA 
val gly his arg leu arg leu pro phe)oPA | ^ro gly ala ala trp ala pro thr thrJOPA 
541/181 ec<SHDK*>.92V 571/191 

GGC ACG TCA TGT CTC AGC GGC CCA CCG CCA CCT CGG TCG CCG GCA GTA TGT CAG CAT GTG 
jately thr ser cys leu ser gly pro pro pro pro arg ser pro ala val cys gin his val 
to M£>. $2£ 601/201 631/211 

CAG ATG ACT CCA CGC AGC CTT GTT CGC ATC GTT GGT GTC GTG GTT GCG ACG ACC TTG GCG 

gin met thr pro arg ser leu val arg ile val gly val val val ala thr thr leu ala 

661/221 691/231 

CTG GTG AGC GCA CCC GCC GGC GGT CGT GCC GCG CAT GCG GAT C 

leu val ser ala pro ala gly gly arg ala ala his ala asp) 

SEQ ID Xce*. 22t>-2£l, 



FIGURE 13A 



38/185 



32/11 



^> |t>*b. ZZ% ^GGA TTT 


CGT 


TGC 


CCG 


ATG 


GAT 


TGT ' 


TTG 


TAC 


GGT TTG 


GGA 


AAA 


ACA 


CTT 


GAA 


GTC 


CTT 


TTT 


-fely phe 


arg 


cys 


pro 


met 


asp 


cys 


leu 


tyr 


gly leu 


gly 


lys 


thr 


leu 


glu 


val 


leu 


phe 


to 62/21 


















92/31 


















ATT GGC 


AAT 


GCT 


GGA 


AAT 


GGA 


CAT 


TCC 


AAT 


ATT GCG 


CGA 


ATT 


AAC 


CGA 


ACA 


CGG 


TGA 


GGG 


ile gly 


asn 


ala 


gly 


asn 


gly 


his 


ser 


asn 


ile ala 


arg 


ile 


asn 


arg 


thr 


arg\ 


OPA/gly 


122/41 


















152/51 










See* it> 23^ 


GGG GGC 


AAG 


CGT 


TTG 


TAC 


CGG 


GGC 


GAG 


CAA 


GCG CCG 


CCG 


ACC 


GGT 


TGA 


CCG 


AAG 


CCA 


GCA 


gly gly 


lys 


arg 


leu 


tyr 


arg 


gly 


gin 


gin 


ala pro 


pro 


thr 


gly) 


OP A/pro 


lys 


pro 


ala 


182/61 


















212/71 












TGT TGT 


TGT 


GTC 


AGC 


GCG 


GGC 


TTG 


GTC 


TCG 


ATG TCC 


CGG 


CCT 


TGG 


CTG 


GAC 


CCG 


CTT 


CTT 


cys cys 


cys 


val 


ser 


ala 


gly 


leu 


val 


ser 


met ser 


arg 


pro 


trp 


leu 


asp 


pro 


leu 


leu 


242/81 


















272/91 


















CAA AAC 


AGG 


TTG 


AAC 


TTA 


ACG 


ACT 


CAA 


GAA 


CGG AAA 


CGC 


TTG 


AAC 


CGC 


GAC 


GTC 


GCT 


CCG 


gin asn 


arg 


leu 


asn 


leu 


thr 


thr 


gin 


glu 


arg lys 


arg 


leu 


asn 


arg 


asp 


val 


ala 


pro 


302/101 


















332/111 


















GAC ACC 


AAT 


TTG 


ACT 


CGG 


CTC 


TTT 


GGC 


AAT 


TGA AGG 


TGA 


GCT 


GCG 


AGC 


AGC 


CGG 


GTG 


ACC 


asp thr 


asn 


leu 


thr 


arg 


leu 


phe 


gly 


asn] 


|OPA arg 


OPA J^la 


ala 


ser 


ser 


arg 


val 


thr 


362/121 


















392/131««> i£> 2SZ 












GCA TCG 


TTG 


GCC 


TTG 


CCA 


TCA 


ATC 


GCC 


GGC 


TCG CGG 


ACG 


TAG 


ATA 


ATC 


AGC 


TCA 


CCG 


TTG 


ala ser 


leu 


ala 


leu 


pro 


ser 


ile 


ala 


gly 


ser arg 


thr] AMB file 


ile 


ser 


ser 


pro 


leu 


422/141 


















452/151 












GGA CCG 


ACC 


TCG 


ACC 


AGG 


GGT 


CCT 


TTG 


TGA 


CTG CCG 


GGC 


TTG 


ACG 


CGG 


ACG 


ACC 


ACA 


GAG 


gly pro 


thr 


ser 


thr 


arg 


gly 


pro 


leu) 


OPA (leu. pro 


gly 


leu 


thr 


arg 


thr 


thr 


thr 


glu 


482/161 














512/171 


















' 'TCG' GTC "ATC 


GCC 


TAA 


GGC 


TAC 


CGT 


TCT 


GAC 


CTG GGG 


CTG 


CGT 


GGG 


CGC 


CGA 


CGA 


CGT 


GAG 


ser val 


ile 


ala] 


|OCHYgly 


tyr 


arg 


ser 


asp 


leu gly 


leu 


arg 


gly 


arg 


arg 


arg 


arg 


glu 


542/181 












572/191 


















GCA CGT 


CAT 


GTC 


TCA 


GCG 


GCC 


CAC 


CGC 


CAC 


CTC GGT 


CGC 


CGG 


CAG 


TAT 


GTC 


AGC 


ATG 


TGC 


ala arg 


his 


val 


ser 


ala 


ala 


his 


arg his 


leu gly 


arg 


arg 


gin 


tyr 


val 


ser 


met 


cys 


602/201 


















632/211 


















AGA TGA 


CTC 


CAC 


GCA 


GCC 


TTG 


TTC 


GCA 


TCG 


TTG GTG 


TCG 


TGG 


TTG 


CGA 


CGA 


CCT 


TGG 


CGC 



arg) OPA ("leu his ala ala leu phe ala ser leu val ser trp leu arg arg pro trp arg 

662/221% e «S> <i>K*. 5U3<* 692/231 

TGG TGA GCG CAC CCG CCG GCG GTC GTG CCG CGC ATG CGG ATC 

trp) OPA £ala his pro pro ala val val pro arg met arg ile) 



SEQ ID 222-231 



FIGURE 13B 



39/185 



33/11 



1 Aft. v>SSft»GAT TTC 


GTT 


GCC 




TC1C 


ATT 


GTT 


TGT 


ACG 


GTT TGG 


GAA AAA 


CAC 




AAC^ 


TCC 


rprrirp 
111 


TT A 
1 1 J-\. 




va 1 


ala 


arg 


trp 


ile 


va 1 


cys 


thr 


val trp 


glu lys 


his 


leu 


lvs 


ser 


phe 


leu 




















93/31 
















TTG GCA 


ATG 


CTG 


GAA 


ATG 


GAC 


ATT 


CCA 


ATA 


TTG CGC 


GAA TTA 


ACC 


GAA 


CAC 


GGT 


GAG 


GGG 


leu ala 


met 


leu 


glu 


met 


asp 


ile 


pro 


ile 


leu arg 


glu leu 


thr 


glu 


his 


alv 


glu 


giy 


123/41 


















153/51 
















GGG GCA 


AGC 


GTT 


TGT 


ACC 


GGG 


GCC 


AGC 


AAG 


CGC CGC 


CGA CCG 


GTT 


GAC 


CGA 


AGC 


CAG 


CAT 


alv ala 


ser 


val 


cys 


thr 


gly 


ala 


ser 


lys 


arg arg 


arg pro 


val 


asp 


arg 


ser 


gin 


his 


183/61 


















213/71 
















GTT GTT 


GTG 


TCA 


GCG 


CGG 


GCT 


TGG 


TCT 


CGA 


TGT CCC 


GGC CTT 


GGC 


TGG 


ACC 


CGC 


TTC 


TTC 


va 1 va 1 


val 


ser 


ala 


arg 


ala 


trp 


ser 


arq 


cvs pro 


gly leu 


giy 


trp 


thr 


arg 


phe 


phe 


243/81 


















273/91 
















AAA ACA 


GGT 


TGA 


ACT 


TAA 


CGA 


CTC 


AAG 


AAC 


GGA AAC 


GCT TGA 


ACC 


GCG 


ACG 


TCG 


CTC 


CGG 


lys thr 


giy) 

=3 J/ 


OPA 


thr 


OCHfarg 


leu 


lys 


asn 


gly asn 


ala) OPA (thr 


ala 


thr 


ser 


leu 


arg 


303/101 














333/111 












ACA CCA 


ATT 


TGA 


CTC 


GGC 


TCT 


TTG 


GCA 


ATT 


GAA GGT 


GAG CTG 


CGA 


GCA 


GCC 


GGG 


TGA 


CCG 


thr pro 


ilei 


OPA 


(leu 


gly 


ser 


leu 


ala 


ile 


qIu alv 


glu leu 


arg 


ala 


ala 


giy) 


OPA (pro 


363/121 














393/131 








Sees £>*^> z**3 


CAT CGT 


TGG 


CCT 


TGC 


CAT 


CAA 


TCG 


CCG 


GCT 


CGC GGA 


CGT AGA 


TAA 


TCA 


GCT 


CAC 


CGT 


TGG 


his arg 


trp 


pro 


cys 


his 


gin 


ser 


pro 


ala 


arg ' gly 


arg arg] 


lOCHfeer 


ala 


his 


arg 


trp 


423/141 


















453/151 










GAC CGA 


CCT 


CGA 


CCA 


GGG 


GTC 


CTT 


TGT 


GAC 


TGC CGG 


GCT TGA 


CGC 


GGA 


CGA 


CCA 


CAG 


AGT 


asp arg 


pro 


arg 


pro 


gly 


val 


leu 


cys 


asp 


cys arg 


ala) OPA (arg 


giy 


arg 


pro 


gin 


ser 


— - - ^4 8 3/ 161* 


















513/171' 












CGG TCA 


TCG 


CCT 


AAG 


GCT 


ACC 


GTT 


CTG 


ACC 


TGG GGC 


TGC GTG 


GGC 


GCC 


GAC 


GAC 


GTG 


AGG 


arg ser 


ser 


pro 


lys 


ala 


thr 


val 


leu 


thr 


trp gly 


cys val 


gly 


ala 


asp 


asp 


val 


arg 


543/181 


















573/191 
















CAC GTC 


ATG 


TCT 


CAG 


CGG 


CCC 


ACC 


GCC 


ACC 


TCG GTC 


GCC GGC 


AGT 


ATG 


TCA 


GCA 


TGT 


GCA 


his val 


met 


ser 


gin 


arg 


pro 


thr 


ala 


thr 


ser val 


ala gly 


ser 


met 


ser 


ala 


cys 


ala 


603/201 


















633/211 
















GAT GAC 


TCC 


ACG 


CAG 


CCT 


TGT 


TCG 


CAT 


CGT 


TGG TGT 


CGT GGT 


TGC 


GAC 


GAC 


CTT 


GGC 


GCT 


asp asp 


ser 


thr 


gin 


pro 


cys 


ser 


his 


arg 


trp cys 


arg gly 


cys 


asp 


asp 


leu 


giy 


ala 


663/221 


















693/231 
















GGT GAG 


CGC 


ACC 


CGC 


CGG 


CGG 


TCG 


TGC 


CGC 


GCA TGC 


GGA TC 














gly glu 


arg 


thr 


arg 


arg 


arg 


ser 


cys 


arg 


ala cys 


giy) 















SEQ ID l/lov 23^2^5 
FIGURE 13C 



40/185 





part of 


the 


nucleotide sequence 


of seql3A 


















T71" 
















31/11 


















•ZMV^GGG TCC 


TTT 


GTG ACT 


GCC 


GGG 


CTT 


GAC 


GCG 


GAC GAC 


CAC 


AGA GTC 


GGT 


CAT 


CGC 


CTA 


AGG 


i& ifc Po- 


-*(gly ser 


phe 


val thr 


ala 


gly 


leu 


asp 


ala 


asp asp 
91/31 


his 


arg val 


gly 


his 


arg 


leu 


arg 




CTA CCG 


TTC 


TGA CCT 


GGG 


GCT 


GCG 


TGG 


GCG 


CCG ACG 


ACG 


TGA GGC 


ACG 


TCA 


TGT 


CTC 


AGC 




leu pro 


phe) 


OPAfpro 


gly 


ala 


ala 


trp 


ala 


pro thr 


thrj 


lOPAfgly 
GTG CAG 


thr 


ser 


cys 


leu 


ser 




121/41 












151/51 
















GGC CCA 


CCG 


CCA CCT 


CGG 


TCG 


CCG 


GCA 


GTA 


TGT CAG 


CAT 


ATG 


ACT 


CCA 


CGC 


AGC 




gly pro 


pro 


pro pro 


arg 


ser 


pro 


ala 


val 


cys gin his 


val gin 


met 


thr 


pro 


arg 


ser 




181/61 
















211/71 


















CTT GTT 


CGC 


ATC GTT 


GGT 


GTC 


GTG 


GTT 


GCG 


ACG ACC 


TTG 


GCG CTG 


GTG 


AGC 


GCA 


CCC 


GCC 




leu val 


arg 


ile val 


gly 


val 


val 


val 


ala 


thr thr 


leu 


ala leu 


val 


ser 


ala 


pro 


ala 



241/81 

GGC GGT CGT GCC GCG CAT GCG GAT C 
gly gly arg ala ala his ala asp) 



SEQ ID Kto. 2HCfi ' 2 C I9 



FIGURE 13A f 



1/1 31/11 
lt> ^ ZSOtGGT CCT TTG TGA CTG CCG GGC TTG ACG CGG ACG ACC ACA GAG TCG GTC ATC GCC TAA GGC 
gly pr^eu OPA leu pro gly leu thr arg thr thr thr glu ser val ile ala^OCH^ly 

61/21*^ lD *&.2fet 91/31 5£S> IDAD* 262 

TAC CGT TCT GAC CTG GGG CTG CGT GGG t CGC CGA CGA CGT GAG GCA CGT CAT GTC TCA GCG 

tyr arg ser asp leu gly leu arg gly arg arg arg arg glu ala arg his val ser ala 
121/41 151/51 

GCC CAC CGC CAC CTC GGT CGC CGG CAG TAT GTC AGC ATG TGC AGA TGA CTC CAC GCA GCC 

ala his arg his leu gly arg arg gin tyr val ser met cys arg)0PA^leu his ala ala 
181/61 211/71 S£«> lb Ko. 2€>3 

TTG TTC GCA TCG TTG GTG TCG TGG TTG CGA CGA CCT TGG CGC TGG TGA GCG CAC CCG CCG 

leu phe ala ser leu val ser trp leu arg arg pro trp arg trp\ OPA^ala his pro pro 
241/81 ix>rto.26W 

GCG GTC GTG CCG CGC ATG CGG ATC 
ala val val pro arg met arg ile) 

SEQ ID 250- 26M 



FIGURE 13B f 



1/1 31/11 



IC>*^C>- GTC CTT 


TGT 


GAC 


TGC 


CGG 


GCT TGA 


CGC 


GGA 


CGA CCA 


CAG 


AGT 


CGG 


TCA 


TCG 


CCT 


AAG 


GCT 


^(val leu 


cys 


asp 


cys 


arg 


ala) OPA 


(arg 


gly 


arg pro 


gin 


ser 


arg 


ser 


ser 


pro 


lys 


ala 


i£> 2S<o 61/21 












91/31 
















CGG 


ACC GTT 


CTG 


ACC 


TGG 


GGC 


TGC GTG 


GGC 


GCC 


GAC GAC 


GTG 


AGG 


CAC 


GTC 


ATG 


TCT 


CAG 


thr val 


leu 


thr 


trp 


gly 


cys val 


gly 


ala 


asp asp 


val 


arg 


his 


val 


met 


ser 


gin 


arg 


121/41 
















151/51 


















CCC ACC 


GCC 


ACC 


TCG 


GTC 


GCC GGC 


AGT 


ATG 


TCA GCA 


TGT 


GCA 


GAT 


GAC 


TCC 


ACG 


CAG 


CCT 


pro thr 


ala 


thr 


ser 


val 


ala gly 


ser 


met 


ser ala 


cys 


ala 


asp 


asp 


ser 


thr 


gin 


pro 


181/61 
















211/71 


















TGT TCG 


CAT 


CGT 


TGG 


TGT 


CGT GGT 


TGC 


GAC 


GAC CTT 


GGC 


GCT 


GGT 


GAG 


CGC 


ACC 


CGC 


CGG 


cys ser 


his 


arg 


trp 


cys 


arg gly 


cys 


asp 


asp leu 


gly 


ala 


gly 


glu 


arg 


thr 


arg 


arg 



241/81 

CGG TCG TGC CGC GCA TGC GGA TC 
arg ser cys arg ala cys gly} 



SEQ ID 265-£57 
FIGURE 13C 1 



41/185 



sequence Rvl984c predicted by Cole et al. (Nature 393:537-544) and containing 
seql3A' 

1/1 31/11 

gtt ggt gtc gtg gtt gcg acg acc ttg gcg ctg 
^ w—*- — - - - - val g lv val val val ala tnr tnr leu a l a leu 

^'^•^ewi 9i/3i 

gcg cat gcg gat ccg tgt teg gac ate gcg gtc 
ala his ala asp pro cys ser asp ile ala val 
151/51 

tct ggt ctt ggc gac gtc ggt gag gcg ttc gtc 
ser gly leu gly asp val gly glu ala phe val 
211/71 

egg teg att ggg gtc tac gcg gtg aac tac cca 
arg ser ile gly val tyr ala val asn tyr pro 
271/91 

tea aac ggt tec gat gat gcg age gee cac ate 
ser asn gly ser asp asp ala ser ala his ile 
331/111 

acc agg att gtg ctt ggt ggc tat teg cag ggt 
thr arg' ile val leu gly gly tyr ser gin gly 
391/131 

gcg atg ccg ccc gcg gtg gca gat cat gtc gee 
ala met pro pro ala val ala asp his val ala 
451/151 

agt ggt ttc tec age atg ttg tgg ggc ggc ggg 
ser gly phe ser ser met leu trp gly gly gly 
511/171 

age tct aag acc ata aac ttg tgt get ccc gac 
ser ser lys thr ile asn leu cys ala pro asp 
571/191 

att atg gcg cat gtt teg tat gtt cag teg ggg 
ile met ala his val ser tyr val gin ser gly 
631/211 

gcg aac agg etc gat cac gee gga tga 
ala asn arg leu asp his ala gly) OPA 



►atg act 


cca 


cgc 


age 


ctt 


gtt 


cgc 


ate 


(Met thr 


pro 


arg 


ser 


leu 


val 


arg 


ile 


61/21 
















ata age 


gca 


ccc 


gee 


ggc 


ggt 


cgt 


gee 


val ser 


ala 


pro 


ala 


gly 


gly 


arg 


ala 


121/41 
















gtt ttc 


get 


cgc 


ggc 


acg 


cat 


cag 


get 


val phe 


ala 


arg 


gly 


thr 


his 


gin 


ala 


181/61 
















gac teg 


ctt 


acc 


teg 


caa 


gtt 


ggc 


ggg 


asp ser 


leu 


thr 


ser 


gin 


val 


gly 


gly 


241/81 
















gca age 


gac 


gac 


tac 


cgc 


gcg 


age 


gcg 


ala ser 


asp 


asp 


tyr 


arg 


ala 


ser 


ala 


301/101 
















cag cgc 


acc 


gtc 


gee 


age 


tgc 


ccg 


aac 


gin arg 


thr 


val 


ala 


ser 


cys 


pro 


asn 


361/121 
















gcg acg 


gtc 


ate 


gat 


ttg 


tec 


acc 


teg 


ala thr 


val 


ile 


asp 


leu 


ser 


thr 


ser 


4217141 
















get gtc 


gee 


ctt 


ttc 


ggc 


gag 


cca 


tec 


ala val 


ala 


leu 


phe 


gly 


glu 


pro 


ser 


481/161 
















teg ttg 


ccg 


aca 


ate 


ggt 


ccg 


ctg 


tat 


ser leu 


pro 


thr 


ile 


gly 


pro 


leu 


tyr 


541/181 
















gat cca 


ata 


tgc 


acc 


gga 


ggc 


ggc 


aat 


asp pro 


ile 


cys 


thr 


gly 


gly 


gly 


asn 


601/201 
















atg aca 


age 


cag 


gcg 


gcg 


aca 


ttc 


gcg 


met thr 


ser 


gin 


ala 


ala 


thr 


phe 


ala 



SEQ ID Hj=z>. 2SB- 25ft 
FIGURE 13D 



42/185 



Seql3F: ORF predicted by Cole et al. 
1/1 

i& lT> Ac. ZCcO 1 ^^ <39 c ac< 3 tca tc 3^ ctc a 9 c 99 c cca 

OP^foly thr ser cys leu ser gly pro 

gtg cag atg act cca cgc age ctt gtt 
val gin met thr pro arg ser leu val 
121/41 

gcg ctg gtg age gca ccc gec ggc ggt 
ala leu val ser ala pro ala gly gly 
181/61 

gcg gtc gtt ttc get cgc ggc acg cat 
ala val val phe ala arg gly thr his 
241/81 

ttc gtc gac teg ctt ace teg caa gtt 
phe val asp ser leu thr ser gin val 
301/101 

tac cca gca age gac gac tac cgc gcg 
tyr pro ala ser asp asp tyr arg ala 
361/121 

cac ate cag cgc acc gtc gee age tgc 
his ile gin arg thr val ala ser cys 
. 421/141 

cag ggt gcg acg gtc ate gat ttg tec 
gin gly ala thr val ile asp leu ser 
481/161 

gtc gee get gtc gee ctt ttc ggc gag 
val ala ala val ala leu phe gly glu 
541/181 

ggc ggg teg ttg ccg aca ate. ggt ccg 
gly gly ser leu pro thr ile gly pro 
601/201 

ccc gac gat cca ata tgc acc gga ggc 
pro asp asp pro ile cys thr gly gly 
661/221 

teg ggg atg aca age cag gcg gcg aca 
ser gly met thr ser gin ala ala thr 



(Nature 393:537-544) and containing Rvl984c 
31/11 

ccg cca cct egg teg ccg gca gta tgt cag cat 
pro pro pro arg ser pro ala val cys gin his 
91/31 

cgc ate gtt ggt gtc gtg gtt gcg acg acc ttg 
arg ile val gly val val val ala thr thr leu 
151/51 

cgt gee gcg cat gcg gat ccg tgt teg gac ate 
arg ala ala his ala asp pro cys ser asp ile 
211/71 

cag get tct ggt ctt ggc gac gtc ggt gag gcg 
gin ala ser gly leu gly asp val gly glu ala 
271/91 

ggc ggg egg teg att ggg gtc tac gcg gtg aac 
gly gly arg ser ile gly val tyr ala val asn 
331/111 

age gcg tca aac ggt tec gat gat gcg age gee 
ser ala ser asn gly ser asp asp ala ser ala 
391/131 

ccg aac acc agg att gtg ctt ggt ggc tat teg 
pro asn thr arg ile val leu gly gly tyr ser 
451/151 

acc teg gcg atg ccg ccc gcg gtg gca gat cat 
thr ser ala met pro pro ala val ala asp his 
511/171 

cca tec agt ggt ttc tec age atg ttg tgg ggc 
pro ser ser gly phe ser ser met leu trp gly 
571/191 

ctg tat age tct aag acc ata aac ttg tgt get 
leu tyr ser ser lys thr ile asn leu cys ala 
631/211 

ggc aat att atg gcg cat gtt teg tat gtt cag 
gly asn ile met ala his val ser tyr val gin 
691/231 

ttc gcg gcg aac agg ctc gat cac gec gga tga 
phe ala ala asn arg leu asp his ala gly) OPA 



SEQ ID K£c>. 2<<o- 2<t>t 
FIGURE 13F 



43/185 



31/11 

<S> \t>ite>. 2Ct>2>CCA CCG GGG CTG GAG GGG CGA ATG TGC GCC GAA CGC CGT CGG CCA ACT TGG CCG CTG AGG 
Jbro pro gly leu glu gly arg met cys ala glu arg arg arg pro thr trp pro leu arg 

GCG GCT GAT CCC CTG GCC CGA GAC GGG GCA AGC CAA TAG CGG CTC CAT CGG GCT TTG CTG 
ala ala asp pro leu ala arg asp gly ala ser gln^ AMB^(arg leu his arg ala leu leu 
121/41 151/51 36CS> j G Kc. 2.C* c l 

GTA GCG GTT CGG CGG GAA CCG AGC GCC GAC GTT GTC GGT GCC CGG TGA TAT ATT GGG TCA 
val ala val arg arg glu pro ser ala asp val val gly ala arg)oPA(tyr ile gly ser 
* 181/61 211/71 v ^yu>. Z(mS 

GAC GGG TAT GGC GGC GAC TGA GGT GAT CTG CGA CAC GCC GCC GCG GTG CTC GAG CCA GGC 
asp gly tyr gly gly asp)OPA(gly asp leu arg his ala ala ala val leu glu pro gly 
241/81 Se(Si it> t\P- 20(o 271/91 

TTA CGA CCA GGG AAT TTC GAA AAT GTT ATT CAG AAC ATC TTG TAT CTC TTC CTC CGT GCC 
leu arg pro gly asn phe glu asn val ile gin asn ile leu tyr leu phe leu arg ala 
301/101 331/111 

ACC CCC TAG GTG TAG TGT TTT CGA GTA CCG GCA GAT CCC AGT TCA CCA GTC TCA CCA GAT 
thr pro) AMB val AMB (cys phe arg val pro ala asp pro ser ser pro val ser pro asp) 
*SC«Sj vy^> i^o 2 Op 7 

C 

SEQ ID K/*>. 2jU>2'Z(*7 
FIGURE 14A 



32/11 

lE ,<5> iV> *£> 2fc« CAC CGG GGC TGG AGG GGC GAA TGT GCG CCG AAC GCC GTC GGC CAA CTT GGC CGC TGA GGG 
fhis arq qly trp arg gly glu cys ala pro. asn ala val gly gin leu gly arg)0PA(cjly 

.tjfciS ^ ?^^ 6 2/21 92/31 Se<S> Z~IO 

CGG CTG ATC CCC TGG CCC GAG ACG GGG CAA GCC AAT AGC GGC TCC ATC GGG CTT TGC TGG 
arg leu ile pro trp pro glu thr gly gin ala asn ser gly ser ile gly leu cys trp) 
122/41 152/51 

TAG CGG TTC GGC GGG AAC CGA GCG CCG ACG TTG TCG GTG CCC GGT GAT ATA TTG GGT CAG 
AM&iarg phe gly gly asn arg ala pro thr leu ser val pro gly asp ile leu gly gin 
<5£<* ^^.Z7rT^2/61 212/71 

ACG GGT ATG GCG GCG ACT GAG GTG ATC TGC GAC ACG CCG CCG CGG TGC TCG AGC CAG GCT 
thr gly met ala ala thr glu val ile cys asp thr pro pro arg. cys ser ser gin ala 
242/81 272/91 

TAC GAC CAG GGA ATT TCG AAA ATG TTA TTC AGA ACA TCT TGT ATC TCT TCC TCC GTG CCA 
tyr asp gin gly ile ser lys met leu phe arg thr ser cys ile ser ser ser val pro 
302/101 332/111 

CCC CCT AGG TGT AGT GTT TTC GAG TAC CGG CAG ATC CCA GTT CAC CAG TCT CAC CAG ATC 
pro pro arg cys ser val phe glu tyr arg gin ile pro val his gin ser his gin ile) 



SEQ ID ZU < i-Z'1\ 
FIGURE 14B 



44/185 



33/11 

£(£> lV} V^o. ZH&CC GGG GCT GGA GGG GCG AAT GTG CGC CGA ACG CCG TCG GCC AAC TTG GCC GCT GAG GGC 
^-^jCthr gly ala gly gly ala asn val arg arg thr pro ser ala asn leu ala ala glu gly 
» iT>lAa-2^ 63/21 93/31 



GGC TGA TCC CCT GGC CCG AGA CGG GGC AAG CCA ATA GCG GCT CCA TCG GGC TTT GCT GGT 
gly^OPA Jer pro gly pro arg arg gly lys pro ile ala ala pro ser gly phe ala gly 

123/41 -sec to znu> 153/51 



AGC GGT 


TCG 


GCG 


GGA 


ACC 


GAG CGC CGA 


CGT 


TGT CGG 


TGC 


CCG 


GTG 


ATA 


TAT 


TGG 


GTC 


AGA 


ser gly 


ser 


ala 


gly 


thr 


glu arg arg 


arg 


cys arg 


cys 


pro 


val 


ile 


tyr 


trp 


val 


arg 


183/61 














213/71 


















CGG GTA 


TGG 


CGG 


CGA 


CTG 


AGG TGA TCT 


GCG 


ACA CGC 


CGC 


CGC 


GGT 


GCT 


CGA 


GCC 


AGG 


CTT 


arg val 


trp 


arg 


arg 


leu 


arg^OP^(ser 


ala 


thr arg 


arg 


arg 


gly 


ala 


arg 


ala 


arg 


leu 


243/81 












27-7 273/91 


















ACG ACC 


AGG 


GAA 


TTT 


CGA 


AAA TGT TAT 


TCA 


GAA CAT 


CTT 


GTA 


TCT 


CTT 


CCT 


CCG 


TGC 


CAC 


thr thr 


arg 


glu 


phe 


arg 


lys cys tyr 


ser 


glu his 


leu 


val 


ser 


leu 


pro 


pro 


cys 


his 


303/101 














333/111 


















CCC CTA 


GGT 


GTA 


GTG 


TTT 


TCG AGT ACC 


GGC 


AGA TCC 


CAG 


TTC 


ACC 


AGT 


CTC 


ACC 


AGA 


TC 


pro leu 


gly 


val 


val 


phe 


ser ser thr 


gly 


arg ser 


gin 


phe 


thr 


ser 


leu 


thr 


arg 


) 



SEQ ID P1H- Zll 
FIGURE 14C 



part of the nucleotide sequence of seq!4A 

1/1 31/11 
m i£> MjD. 21*-* TTT T cg AGT ACC GGC AGA TCC CAG GTT CAC CAG GTC TCA CCA GAT C 

iO *a. 2T^-^ phe ser Ser thr gly arg Ser gln Val hiS Val Pr ° 

SEQ ID njZ> 27*'Z74 
FIGURE 14A' 



1/1 31/11 
EiS, it> 2&>^ TGT TTT CGA GTA CCG GCA GAT CCC AGG TTC ACC AGG TCT CAC CAG ATC 
rjb tt> 1U> 2£(-^cys phe arg val pro ala asp pro arg phe thr arg ser his gln ile 

SEQ ID 250- 22/ 

FIGURE 14C 



1/1 31/11 
€£> iDWo-2S2^GTT TTC GAG TAC CGG CAG ATC CCA GGT TCA CCA GGT CTC ACC AGA TC 
as \OK,6.e33^val phe glu tyr arg gln ile pro gly ser pro gly leu thr arg 



SEQ ID VCc^,-2S2- Z^S 
FIGURE 14C 



45/185 



ORF predicted based on the sequence published by Cole et al . (Nature 393:537- 
544) and containing seql4A' 

1/1 31/11 

•JS> »t>K£>. 2gt/ XAG CGG TTC.GGC GGG AAG CTA GCG GCG ACG TTG TCG GTG GCC GGT GAT ATA TTG GGT CAG 

AMB-iarq phe gly gly lys leu ala ala thr leu ser val ala gly asp ile leu gly gin 
\X> VU3.2Z5 -gY/21 91/31 

ACG GGT ATG GCG GCG GCT GAG GTG ATC TGC GAC ACG CCG CCG CGG TGC TCG AGC CAG GCT 

thr qly met ala ala ala glu val ile cys asp thr pro pro arg cys ser ser gin ala 

■ 121/41 151 ' 51 
TAC GAC CAG GGA ATT TCG AAA ATG TTA TTC AGA ACA TCT TGT ATC TCT TCT CCG TGC CAC 
tvr asp gin gly ile ser lys met leu phe arg thr ser cys ile ser ser pro cys his 
181/61 211/71 

CCC CTA GGT GTA GTG TTT TCG AGT ACC GGC AGA TCC CAG GTT CAC CAG GTC TCA CCA gat 
pro leu gly val val phe ser ser thr gly arg ser gin val his gin val ser pro asp 
241/81 271/91 

cca egg ggc gcg atg aac ttc ccg gca teg gca teg cca ggt cga egg acg tgg teg cgc 
pro arg gly ala met asn phe pro ala ser ala ser pro gly arg arg thr trp ser arg 
301/101 331/111 

tat gac ggg aat ctg gag cct tgt egg gee get caa cat ate gaa gat gca eta ctt gag 
tvr asp gly asn leu glu pro cys arg ala ala gin his ile glu asp ala leu leu glu 
361/121 391/131 

teg ttg cca gat cct gtc aga ttc ccg att tec gca aag gag egg tac gec cat gac cgt 
ser leu pro asp pro val arg phe pro ile ser ala lys glu arg tyr ala his asp arg 

— -421/.141. - ■ ■ • ■ - ■■- ...... 

gac cgt tta cac taa 
asp arg leu his) OCH 

SEQ ID K£C>. 28 c i - 2*€> 
FIGURE 14T> 



46/185 



Sequence Rv3054c predicted by Cole et al . (Nature 393:537-544) 
which may be in phase with Seql4A f 



1/1 



31/11 



A> iT^VU> 29k-»gtg tea gat acc aag tec gac ate aaa ate ttg gee tta gtg gga age ctg cgc gcg gcg 
^ ° * ^/val ser asp thr lys ser asp ile lys ile leu ala leu val gly ser leu arg ala ala 
* DlI>Ktf ' ZBr!r 61/21 91/31 

teg ttc aac cgc cag ate gee gag ctg get gec aag gtc get ccg gac ggc gtc acc gtc 
ser phe asn arg gin ile ala glu leu ala ala lys val ala pro asp gly val thr val 
121/41 151/51 

acc atg ttc gag ggg ctg ggg gac ctg ccg ttc tac aac gaa gac ate gac aca gcg acg 
thr met phe glu gly leu gly asp leu pro phe tyr asn glu asp ile asp thr ala thr 
181/61 211/71 

gag gtg ccg gcg ccg gtg age gcg ttg egg gag gee gcg tct gac gcg cac get gee ttg 
glu val pro ala pro val ser ala leu arg glu ala ala ser asp ala his ala ala leu 
241/81 271/91 

gtg gtc acg ccg gaa tac aac ggc age att ccg gec gtg ate aag aac gcg ate gac tgg 
val val thr pro glu tyr asn gly ser ile pro ala val ile lys asn ala ile asp trp 
301/101 331/111 

ctg tec agg cca ttc ggc gat ggc gcg ttg aag gac aag ccg ttg gee gtg ate ggc ggc 
leu ser arg pro phe gly asp gly ala leu lys asp lys pro leu ala val ile gly gly 
361/121 391/131 

tec atg ggc cgc tac ggc ggg gta tgg gcg cac gac gag act cgc aag teg ttc age ate 
ser met gly arg tyr gly gly val ,trp ala his asp glu thr arg lys ser phe ser ile 
421/141 451/151 

get ggc acg egg gtg gtc gat gcg ate aaa ctg teg gtg ccg ttc caa act ctg ggc aag 
ala gly thr arg val val asp ala ile lys leu ser val pro phe gin thr leu gly lys 
481/161 511/171 

teg gtc gcg gac gac gec ggg ctg gcg gcg aat gtg cgc gac gee gtc ggc aac ttg gee 
ser val ala asp asp ala gly leu ala ala asn val arg asp ala val gly asn leu ala 
541/181 

get gag gtc ggc tga 
ala glu val gly) OPA 



SEQ ID kc&-Z*(e~2S7 
FIGURE 14E 



47/185 



ORF predicted by Cole et al. (Nature 393:537-544) and containing Rv3054c 

1/1 31/11 

'jS>it>Ao 2S&* aa C 9 C 9 at cgg aat aaa tcg gac cat ggt CCg gtt ggC tCg tg ° aag g3C gtg gaC ° aa 
OCH/arg asp arg asn lys ser asp his gly pro val gly ser cys lys asp val asp gin 

5& to ito.2#jSl/21 91/31 

caa gcg gaa agg aac gta gca gtg tea gat acc aag tec gac ate aaa ate ttg gee tta 
gin ala glu arg asn val ala val ser asp thr lys ser asp ile lys ile leu ala leu 
1 121/41 151/51 

gtg gga age ctg cgc gcg gcg tcg ttc aac cgc cag ate gee gag ctg get gee aag gtc 
val gly ser leu arg ala ala ser phe asn arg gin ile ala glu leu ala ala lys val 
181/61 211/71 

get ccg gac ggc gtc acc gtc acc atg ttc gag ggg ctg ggg gac ctg ccg ttc tac aac 
ala pro asp gly val thr val thr met phe glu gly leu gly asp leu pro phe tyr asn 
241/81 271/91 

gaa gac ate gac aca gcg acg gag gtg ccg gcg ccg gtg age gcg ttg egg gag gec gcg 
glu asp ile asp thr ala thr glu val pro ala pro val ser ala leu arg glu ala ala 
301/101 331/111 

tct gac gcg cac get gec ttg gtg gtc acg ccg gaa tac aac ggc age att ccg gee gtg 
ser asp ala his ala ala leu val val thr pro glu tyr asn gly ser ile pro ala val 
361/121 391/131 

ate aag aac gcg ate gac tgg ctg tec agg cca ttc ggc gat ggc gcg ttg aag gac aag 
ile lys asn ala ile asp trp leu ser arg pro phe gly asp gly ala leu lys asp lys 

421/141 " 451/151 

ccg ttg gec gtg ate ggc ggc tec atg ggc cgc tac ggc ggg gta tgg gcg cac gac gag 
pro leu ala val ile gly gly ser met gly arg tyr gly gly val trp ala his asp glu 
481/161 511/171 

act cgc aag tcg ttc age ate get ggc acg egg gtg gtc gat gcg ate aaa ctg tcg gtg 
thr arg lys ser phe ser ile ala gly thr arg val val asp ala ile lys leu ser val 
541/181 571/191 

ccg ttc caa act ctg ggc aag tcg gtc gcg gac gac gec ggg ctg gcg gcg aat gtg cgc 
pro phe gin thr leu gly lys ser val ala asp asp ala gly leu ala ala asn val arg 
601/201 631/211 
gac gec gtc ggc aac ttg gec get gag gtc ggc tga 
asp ala val gly asn leu ala ala glu val gly) OPA 



SEQ ID iAxc>.2S-2S9 



FIGURE 14F 
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fragment based on the sequence published by Cole et al . (Nature 393:537-54 4) and 
containing seq 14F 1 and seq 14P' 

WlPJ * S 3lG 31/11 

x^S vl->wc. * cgc gat cgg aat aaa tcg gac ca t ggt ccg gtt ggc teg tgc aag gac gtg gac caa 

OCH^rg asp arg asn lys ser asp his gly pro val gly ser cys lys asp val asp gin 

^iDrto. ZQl £sn ala ile gly ile asn arg thr met val arg leu ala arg ala arg thr trp thr asn 

r-fcs iD VU5. 2%"*a(thr arg ser glu)0CH tie gly pro trp ser gly trp leu val gin gly arg gly pro thr 

^6i/2i ' sea id 3< 2 91/31 

c(S> »T>^. Sll caa g C g gaa agg aac gta gca gtg tea gat acc aag tec gac ate aaa ate ttg gee tta 
gin ala glu arg asn val ala val ser asp thr lys ser asp ile lys ile leu ala leu 
lys arg lys gly thr")AMB foln cys gin ile pro ser pro thr ser lys ser trp pro^AMB 
ser gly lys glu arg serVser val arg tyr gin val arg his gin asn leu gly leu ser 
121/41 5eS> lt>A<>. ZQ^ 151/51 

gtg gga age ctg cgc gcg gcg tcg ttc aac cgc cag ate gee gag ctg get gec aag gtc 
val gly ser leu arg ala ala ser phe asn arg gin ile ala glu leu ala ala lys val 
^/trp glu ala cys ala arg arg arg ser thr ala arg ser pro ser trp leu pro arg ser 
ffi> [DKo.3£> * ala arg gly val val gln pro pro asp arg arg ala gly cys gin gly arg 

181/61 211/71 

get ccg gac ggc gtc acc gtc acc atg ttc gag ggg ctg ggg gac ctg ccg ttc tac aac 
ala pro asp gly val thr val thr met phe glu gly leu gly asp leu pro phe tyr asn 
leu arg thr ala ser pro ser pro cys ser arg gly trp gly thr cys arg ser thr thr 
ser gly arg arg his arg his his val arg gly ala gly gly pro ala val leu gin arg 
241/81 271/91 

gaa gac ate gac aca gcg acg gag gtg ccg gcg ccg gtg age gcg ttg egg gag gee gcg 
glu asp ile asp thr ala thr glu val pro ala pro val ser ala leu arg glu ala ala m 
lys thr ser thr gin arg arg arg cys arg arg argpPA^ala arg cys gly arg pro arg 
arg his arg his ser asp gly gly ala gly ala gly gliAarg. val ala gly . gly arg vaj 
301/101 331/111 S£<S> lt> *£>,<3C1 

tct gac gcg cac get gee ttg gtg* gtc 'acg ccg gaa tac aac ggc age att ccg gec gtg- • . 

ser asp ala his ala ala leu val val thr pro glu tyr asn gly ser ile pro ala val 
leu thr arg thr leu pro trp trp ser arg arg asn thr thr ala ala phe arg pro OPA 
— ^ « OPAJarg ala arg cys leu gly gly his ala gly ile gin arg gin his ser gly arg asp 
S££> ID Ko.3I$£j7^\ 391/131 

ate aag aac gcg ate gac tgg ctg tec agg cca ttc ggc gat ggc gcg ttg aag gac aag sfi2 
ile lys asn ala ile asp trp leu ser arg pro phe gly asp gly ala leu lysaspJLys^ r 
^^(ser arg thr arg ser thr gly cys pro gly his ser ala met ala arg) OPA (ar% thr ser 
5<S> 0> &o>3GZ- ^ ln lu arg asp arg leu ala vai gin a la ile arg arg trp arg val glu gly gin ala 
421/141 451/151 

ccg ttg gec gtg ate ggc ggc tec atg ggc cgc tac ggc ggg gta tgg gcg cac gac gag 
pro leu ala val ile gly gly ser met gly arg tyr gly gly val trp ala his asp glu 
arg trp pro\oPA(ser ala ala pro trp ala ala thr ala gly tyr gly arg thr thr arg 
val gly arg as^farg arg leu his gly pro leu arg arg gly met gly ala arg arg asp 
481/161 SCO *E> 304 511/171 

act cgc aag tcg ttc age ate get ggc acg egg gtg gtc gat gcg ate aaa ctg tcg gtg 
thr arg lys ser phe ser ile ala gly thr arg val val asp ala ile lys leu ser val 
leu ala ser arg ser ala ser leu ala arg gly trp ser met arg ser asn cys arg cys 
ser gin val val gin his arg trp his ala gly gly arg cys asp gin thr val gly ala 
541/181 571/191 

ccg ttc caa act ctg ggc aag tcg gtc gcg gac gac gee ggg ctg gcg gcg aat gtg cgc 



a 
arg 



pro phe gin thr leu gly lys ser val ala asp asp ala gly leu ala ala asn val arg 
arg ser lys leu trp ala ser arg ser arg thr thr pro gly trp arg arg met cys al 
val pro asn ser gly gin val gly arg gly arg arg arg ala gly gly glu cys ala a 
601/201 631/211^«S> it>l\0. 292 

gac gee gtc ggc aac ttg gec get gag gtc ggc tga\tcc ctg ggc cga ggc ggg tea gee 
asp ala val gly asn leu ala ala glu val gly) OPA (ser leu gly arg gly gly ser ala 
thr pro ser ala thr trp pro leu arg ser ala asp pro trp ala glu ala gly gin pro 
arg arg arg gin leu gly arg)oPAYgly arg leu ile pro gly pro arg «g val ser glnj 
661/221 S€A VE> V^.3IV 691/231 *5«9 ^ 2*3 

aat age ggc tec ate ggc ttt get ggt age ggt tcg gcg gga age tagffegg cga cgt tgt 
asn ser gly ser ile gly phe ala gly ser gly ser ala gly serjAMff^arg arg arg cys 
ile ala ala pro ser ala leu leu val ala val arg arg glu ala ser gly asp val val 
AMB (irg leu his arg leu cys trp) AMB (arg phe gly gly lys leu ala ala thr leu ser 

SEQ ID HjW>. 2.40-31(0 



FIGURE 14^ 



49/185 



721/241 751/251 lC> * 

cqq tag ccg gtg ata tat tgg gtc aga egg gta tgg egg egg ctg agg tga^tct gcg aca 
arg trp pro val ile tyr trp val arg arg val trp arg arg leu arg)OPA(ser ala thr 
qlv qly arg^OPA fcyr ile gly ser asp gly tyr gly gly gly OPA(gly asp leu arg his 
val ala glV asp^ile leu gly gin thr gly met ala ala ala glu^val ile cys asp thr 
781/261 tS&to lt}ko.3£5 811/271 & te* 3CCp 

cqc cgc cgc ggt get cga gec agg ctt acg ace agg gaa ttt cga aaa tgt tat tea gaa 
arg arg arg gly ala arg ala arg leu thr thr arg glu phe arg lys cys tyr ser glu 
ala ala ala val leu glu pro gly leu arg pro gly asn phe glu asn val ile gin asn 

nro pro arg cys ser ser gin ala tyr asp gin gly ile ser lys met leu phe arg thr 
41/281 871/291 5££> £> Kc. %<& 



cat ctt gta tct ctt etc cgt gee ace ccc tag gtg tagVtgt ttt cga gta ccg gca gat 
his leu val ser leu leu arg ala thr pro) AMB val AMB (cys phe arg val pro ala asp 
ile leu tyr leu phe ser val pro pro pro arg cys ser val phe glu tyr arg gin lie 
ser cys ile ser ser pro cys his pro leu gly val val phe ser ser thr gly arg ser 
901/301 931/311 SCO* iDKo- Z<*(p 

ccc agg ttc acc agg tct cac cag ate cac ggg gcg cga tgaVact tec egg cat egg cat 
pro arg phe thr arg ser his gin ile his gly ala arg]OPAlthr ser arg his arg his 
pro gly ser pro gly leu thr arg ser thr gly arg asp glu leu pro gly lie gly lie 
qln val his gin val ser pro asp pro arg gly ala met asn phe pro ala ser ala ser 
961/321 991/331 

cgc cag gtc gac gga cgt ggt cgc get atg acg gga ate tgg age ctt gtc ggg ccg etc 
arg gin val asp gly arg gly arg ala met thr gly ile trp ser leu val gly pro leu 
ala arg ser thr asp val val ala leu\OPA(arg glu ser gly ala leu ser gly arg ser 
pro gly arg arg thr trp ser arg tyr asr^gly asn leu glu pro cys arg ala ala gin 
1021/341 (D**3fl*T051/351 

aac ata teg aag atg cac tac ttg agt cgt tgc cag ate ctg tea gat tec cga ttt ccg 
asn ile ser lys met his tyr leu ser arg cys gin ile leu ser asp ser arg phe pro 
thr tyr arg arg cys thr thr)oPA(val val ala arg ser cys gin ile pro asp phe arg 
' ' his ile glu asp ala leu leu glu^ser leu pro asp pro val arg phe pro ile ser ala 
1081/361 3£S> i£> 1111/371 

caa agg age ggt acg ccc atg acc gtg acc gtt tac act aa) 
gin arg ser gly thr pro met thr val thr val tyr thr), 
lys gly ala val arg pro) OPA pro OPA(pro phe thr leu 
lys glu arg tyr ala his asp arg aspTarg leu his)OCH 

SEQ ID \U>z- 2<?0"3/<* 

FIGURE 14 (^continued) 



1/1 31/11 
S£^»t>lVt».S«7CAA GCC CGG CCG CGA CTG TTT GCC GTT TTG GGG CTC CTA CCA GAA CAC CAC CTG GCG GCC 
^,I>Kc.3i«ln ala arg pro arg leu phe ala val leu gly leu leu pro glu his his leu ala ala 

61 /21 91/31 

GCG CAC CAT GGT GTG CAC CAG TTG CGA TCG GTT CCT CCC GCG CGC GGG CGG CGA CGA CGT 
ala his his gly val his gin leu arg ser val pro pro ala arg gly arg arg arg arg 

CGa'tGC CCG CGC CCC GGC GGC GCA GCT GCG TAG CTC GAC CCG GTC GAC GAC GAC GGG GTC 
arg cys pro arg pro gly gly ala ala ala^ -^asp P^-asp asp asp gly 

GGC GGA CCA GTC GGC GAT GTC GAG GCG ATG GCA ATA CAG CGC CTT GGT GCG CGG CCA CAC 
qly gly pro val gly asp val glu ala met ala ile gin arg leu gly ala arg pro hxs 

271/91 

GTc'tGA GGT GGC GAA GAC CAG TCC CGC GCC CAC CGG CAG CCG GAT CCG GAT ACG CGG TAC 
vai) OPAjgly gly glu asp gin ser arg ala his arg gin pro asp pro asp thr arg tyr^ 



SEQ ID ft«bfcfT- 320 



FIGURE 15A 
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AAG CCC 


GGC 


CGC 


GAC 


TGT 


TTG 


CCG 


TTT 


lys pro 


gly 


arg 


asp 


cys 


leu 


pro 


phe 


62/21 
















CGC ACC 


ATG 


GTG 


TGC 


ACC 


AGT 


TGC 


GAT 


arg thr 


met 


val 


cys 


thr 


ser 


cys 


asp 


122/41 
















GAT GCC 


CGC 


GCC 


CCG 


GCG 


GCG 


CAG 


CTG 


asp ala 


arg 


ala 


pro 


ala 


ala 


gin 


leu 


182/61 
















GCG GAC 


CAG 


TCG 


GCG 


ATG 


TCG 


AGG 


CGA 


ala asp 


gin 


ser 


ala 


met 


ser 


arg 


arg 


242/81 
















TCT GAG 


GTG 


GCG 


AAG 


ACC 


AGT 


CCC 


GCG 


ser glu 


val 


ala 


lys 


thr 


ser 


pro 


ala 



32/11 

TGG GGC TCC TAC CAG AAC ACC ACC TGG CGG CCG 
trp gly ser tyr gin asn thr thr trp arg pro 
92/31 

CGG TTC CTC CCG CGC GCG GGC GGC GAC GAC GTC 
arg phe leu pro arg ala gly gly asp asp val 
152/51 

CGT AGC TCG ACC CGG TCG ACG ACG ACG GGG TCG 
arg ser ser thr arg ser thr thr thr gly ser 
212/71 

TGG CAA TAC AGC GCC TTG GTG CGC GGC CAC ACG 
trp gin tyr ser ala leu val arg gly his ■ thr 
272/91 

CCC ACC GGC AGC CGG ATC CGG ATA CGC GGT AC 
pro thr gly ser arg ile arg ile arg gl£) 



SEQ ID K££>. 521' 
FIGURE 15B 



33/11 

$C<$) vS>te>323AGC CCG GCC GCG ACT GTT TGC CGT TTT GGG GCT CCT ACC AGA ACA CCA CCT GGC GGC CGC 
a ser pro ala ala thr val cys arg phe gly ala pro thr arg thr pro pro gly gly arg 

Efc vDVlc>32/f 63/ 2i 93/31 

GCA CCA TGG TGT GCA CCA GTT GCG ATC GGT TCC TCC CGC GCG CGG GCG GCG ACG ACG TCG 
ala pro trp cys ala pro val ala ile gly ser ser arg ala arg ala ala thr thr ser 
123/41 153/51 

ATG CCC GCG CCC CGG CGG CGC AGC TGC GTA GCT CGA CCC GGT CGA CGA CGA CGG GGT CGG 
met pro ala pro arg arg arg ser cys val ala arg pro gly arg arg arg arg gly arg 
183/61 213/71 

CGG ACC AGT CGG CGA TGT CGA GGC GAT GGC AAT ACA GCG CCT TGG TGC GCG GCC ACA CGT 
arg thr ser arg arg cys arg gly asp gly asn thr ala pro trp cys ala ala thr arg 
243/81 273/91 

CTG AGG TGG CGA AGA CCA GTC CCG CGC CCA CCG GCA GCC GGA TCC GGA TAC GCG GTA C 
leu arg trp arg arg pro val pro arg pro pro ala ala gly ser gly tyr ala val) 



SEQ ID SZ-b'&ZH 
FIGURE 15C 
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part of the nucleotide sequence of seq!5A 



GGC GGC 


CGC 


GCG 


CCA 


TGG 


TGT 


GCA 


CCA 


GTT 


31/11 
GCG ATC 


GGT 


TCT 


CCC 


GCG 


CGC 


GGG 


CGG 


CGA 


L6>iDKjo326p gly gly 

61/21 


arg 


ala 


pro 


trp 


cys 


ala 


pro 


val 


ala ile 
91/31 


gly 


ser 


pro 


ala 


arg 


gly 


arg 


arg 


CGA CGT 


CGA 


TGG 


CCG 


CGC 


CCC 


GGC 


GGC 


TGC 


AGC TGC 


GTA 


GCT 


CGA 


CCC 


GGT 


CGA 


CGA 


CGA 


arg arg 
121/41 


arg 


trp 


pro 


arg 


pro 


gly 


gly 


cys 


ser cys 
151/51 


val 


ala 


arg 


pro 


gly 


arg 


arg 


arg 


. CGG GGT 


CGG 


CGG 


GCC 


AGT 


CGG 


CGA 


TGT 


CGA 


GGC GAT 


GGC 


AAT 


ACA 


GCG 


CCT 


TGG 


TGC 


GCG 


arg gly 
181/61 


arg 


arg 


ala 


ser 


arg 


arg 


cys 


arg 


gly asp 
211/71 


gly 


asn 


thr 


ala 


pro 


trp 


cys 


ala 


GCC ACA 


CGT 


CTG 


AGG 


TGG 


CGA 


AGA 


CCA 


GTC 


CCG CGC 


CCA 


CCG 


GCA 


GCC 


GGA 


TC 






ala thr 


arg 


leu 


arg 


trp 


arg 


arg 


pro 


val 


pro arg 


pro 


pro 


ala 


ala 


gly 









SEQ ID koz> 32$' 32C& 
FIGURE ISA' 



1/1 31/11 

;£j&>it> **>327 GCG GCC GCG CGC CAT GGT GTG CAC CAG TTG CGA TCG GTT CTC CCG CGC GCG GGC GGC GAC 

''C; '' , ;u (} , /ala -ala' 'ala-'arg his gly val his* gin leu arg ser val leu pro arg ala gly gly asp 

^^"° 32 * 61/21 91/31 

GAC GTC GAT GGC CGC GCC CCG GCG GCT GCA GCT GCG TAG CTC GAC CCG GTC GAC GAC GAC 

asp val asp gly arg ala pro ala ala ala ala ala^AMsXleu asp pro val asp asp asp 

121/41 151/51 S£G> i£> K0^29 

GGG GTC GGC GGG CCA GTC GGC GAT GTC GAG GCG ATG GCA ATA CAG CGC CTT GGT GCG CGG 

gly val gly gly pro val gly asp val glu ala met ala ile gin arg leu gly ala arg 

181/61 211/71 

CCA CAC GTC TGA GGT GGC GAA GAC CAG TCC CGC GCC CAC CGG CAG CCG GAT C 

pro his val) OPA ^jly gly glu asp gin ser arg ala his arg gin pro asp^ 

SC<$ ID Kx>. 330 

SEQ ID KnS . 32 7 ~s33C> 
FIGURE 15B' 



1/1 31/11 
f>£<& \T>*i>.33tGG CGG CCG CGC GCC ATG GTG TGC ACC AGT TGC GAT CGG TTC TCC CGC GCG CGG GCG GCG 

Lt> Ko SS2 tr P ar 9 P ro arg ala met va ^ C ^ S thr ser cys asp arg phe Ser arg ala arg ala ala 

^ * 61/21 91/31 

ACG ACG TCG ATG GCC GCG CCC CGG CGG CTG CAG CTG CGT AGC TCG ACC CGG TCG ACG ACG 

thr thr ser met ala ala pro arg arg leu gin leu arg ser ser thr arg ser thr thr 

121/41 151/51 

ACG GGG TCG GCG GGC CAG TCG GCG ATG TCG AGG CGA TGG CAA TAC AGC GCC TTG GTG CGC 

thr gly ser ala gly gin ser ala met ser arg arg trp gin tyr ser ala leu val arg 

181/61 211/71 

GGC CAC ACG TCT GAG GTG GCG AAG ACC AGT CCC GCG CCC ACC GGC AGC CGG ATC 

gly his thr ser glu val ala lys thr ser pro ala pro thr gly ser arg ile 



SEQ ID Ro->. 331-332 
FIGURE 15C f 
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ORF containing SeqlSA' according to 

<£> <D lto.3S3 i/ 1 

taa ggt ccg cca acg ctt tac get cga 

e& a") VVE.S3H q^ pyalv pro pro thr leu tyr ala arg 

61/21 

cgt agt cgc cgc agg gca ggg ctt ccc 
arg ser arg arg arg ala gly leu pro 
121/41 

agg ggt age gtt cgt ggg cgt cga cga 
arg gly ser val arg gly arg arg arg 
181/61 

egg tgg ggg tgc gca cgc ccg gcc gcg 
arg trp gly cys ala arg pro ala ala 
241/81 

cca cct ggc ggc cgc gcg cca tgg tgt 
pro. pro gly gly arg ala pro trp cys 
301/101 

egg cga cga cgt cga tgg ccg cgc ccc 
arg arg arg arg arg trp pro arg pro 
361/121 

cga cga egg ggt egg egg gcc agt egg 
arg arg arg gly arg arg ala ser arg 
■ , ■ 421/141 

tgc gcg gcc aca cgt ctg agg tgg cga 
cys ala ala thr arg leu arg trp arg 
481/161 

ggt agg gca ggc gcg agt ctt cag egg 
gly arg ala gly ala ser leu gin arg 
541/181 

agg gta egg gcg gcg tac ggc aac ggt 
arg val arg ala ala tyr gly asn gly 
601/201 

cga agg ggc agg tga 
arg arg gly arg)OPA 



Cole et al. (Nature 393:537-544) 
31/11 

egg ccg cca cga gtt ggc egg cca ctt tea ggc 
arg pro pro arg val gly arg pro leu ser gly 
91/31 

gcg teg tct teg egg gtt tgt egg caa agg tgt 
ala ser ser ser arg val cys arg gin arg cys 
151/51 

cga tgt gca get egg gga tgc egg egg cgc ggg 
arg cys ala ala arg gly cys arg arg arg gly 
211/71 

act gtt tgc gcg ttt tgg ggc tct gcc aga aca 
thr val cys ala phe trp gly ser ala arg thr 
271/91 

gca cca gtt gcg ate ggt tct ccc gcg cgc ggg 
ala pro val ala ile gly ser pro ala arg gly 
331/111 

ggc ggc tgc age tgc gta get cga ccc ggt cga 
gly gly cys ser cys val ala arg pro gly arg 
391/131 

cga tgt cga ggc gat ggc aat aca gcg cct tgg 
arg cys arg gly asp gly asn thr ala pro trp 
451/151 

aga cca gtc ccg cgc cca ccg gca gcc gga tea 
arg pro val pro arg pro pro ala ala gly ser 
511/171 

ggt tgg egg cga cga gca get cca cag agt gtg 
gly trp arg arg arg ala ala pro gin ser val 
571/191 

gaa gca ggc act ccg acg aac cca teg tea cgt 
glu ala gly thr pro thr asn pro ser ser arg 



SEQ ID IA^>* 
FIGURE 150 
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R:Rv2530c predicted according to Cole et ai . (Nature 393:537-544) and which may 
be in phase with SEQ15A 

>e&iO(Ao- 1/1 31/11 

*5t3£gtg acg gca ctg etc gat gtc aat gtg ctg ate gcg ctg ggc tgg ccg aat cac gtt cac 

vfi> iC> Wo3?dyal thr ala leu leu asp val asn val leu ile ala leu gly trp pro asn his val his 

61/21 91/31 

cat gcg gcc gcg cag cga tgg ttc acg cag ttc tec teg aat ggg tgg gee acc acg ccg 
his ala ala ala gin arg trp phe thr gin phe ser ser asn gly trp ala thr thr pro 
121/41 151/51 

ate acc gag gca ggg tat gtc cga att tea age aat cgc agt gtg atg cag gtg teg acc 
■ ile thr glu ala gly tyr val arg ile ser ser asn arg ser val met gin val ser thr 
181/61 211/71 

acg ccg get ate gcg ate get cag ttg gcg gcg atg act tct ctt gec ggg cac acg ttt 
thr pro ala ile ala ile ala gin leu ala ala met thr ser leu ala gly his thr phe 
241/81 271/91 

tgg cct gac gat gtg cca ctg ate gtt ggg age gec ggc gat cgc gat gcg gtg tec aac 
trp pro asp asp val pro leu ile val gly ser ala gly asp arg asp ala val ser asn 
301/101 331/111 

cac cgt egg gtc acc gac tgc cat etc ate gec ttg gee gcg cgc tac ggg ggc egg ttg 
his arg arg val thr asp cys his leu ile ala leu ala ala arg tyr gly gly arg leu 
361/121 391/131 

gtc aca ttc gat gec gca ctg gee gat tea gca tec gca ggc etc gtc gag gtg ttg. tag 
val thr phe asp ala ala leu ala asp ser ala ser ala gly leu val glu val leu)AMB 

SEQ ID «c6.AS6'33<tf> 
FIGURE 15.E 



SeqlSP: ORF according to Cole et al. (Nature 393:537-544) containing Rv2530c 
1/1 31/11 
$CJ$>VO -337 t<5a tgt tcc gcc gga tgc gcc gac ggt gac ttc cga gga tgt cgt ccg cgc gct cga gga 

OP A / cys ser ala gly cys ala asp gly asp phe arg gly cys arg pro arg ala arg gly 
£C5> »0^.s^61/21 91/31 

cga cgt gtg acg gca ctg etc gat gtc aat gtg ctg ate gcg ctg ggc tgg ccg aat cac 
arg arg val thr ala leu leu asp val asn val leu ile ala leu gly trp pro asn his 
121/41 151/51 

gtt cac cat gcg gcc gcg cag cga tgg ttc acg cag ttc tcc teg aat ggg tgg gcc acc 
val his his ala ala ala gin arg trp phe thr gin phe ser ser asn gly trp ala thr 
181/61 211/71 

acg ccg ate acc gag gca ggg tat gtc cga att tea age aat cgc agt gtg atg cag gtg 
thr pro ile thr glu ala gly tyr val arg ile ser ser asn arg ser val met gin val 
241/81 271/91 

teg acc acg ccg gct ate gcg ate gct cag ttg gcg gcg atg act tct ctt gcc ggg cac 
ser thr thr pro ala ile ala ile ala gin leu ala ala met thr ser leu ala gly his 
301/101 331/111 

acg ttt tgg cct gac gat gtg cca ctg ate gtt ggg age gcc ggc gat cgc gat gcg gtg 
thr phe trp pro asp asp val pro leu ile val gly ser ala gly asp arg asp ala val 
361/121 391/131 

tcc aac cac cgt egg gtc acc gac tgc cat etc ate gcc ttg gcc gcg cgc tac ggg ggc 
ser asn his arg arg val thr asp cys his leu ile ala leu ala ala arg tyr gly gly 
421/141 451/151 

egg ttg gtc aca ttc gat gcc gca ctg gcc gat tea gca tcc gca ggc etc gtc gag gtg 

arg leu val thr phe asp ala ala leu ala asp ser ala ser ala gly leu val glu val 

481/161 

ttg tag 

leu\AMB 

* SEQ ID 337-33^ 
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Fragment containing SeqlSP 1 and Seq 15F f 

i/ise&tb^SUk se<s \t> *o 3M7 31/11 

lfi> lDV^32fi - tga tgt tec gec gga tgc gec gac ggt gac ttc cga gga tgt cgt ccg cgc get cga gga 
OPAJcys ser ala gly cys ala asp gly asp phe arg gly cys arg pro arg ala arg gly 

to ""~^p val pro pro asp ala pro thr val thr ser glu asp val val arg ala leu glu asp 

,Met phe arg arg met arg arg arg)oPA/leu pro arg met ser ser ala arg ser arg thr 

»fl>*0.3V* 61 /2i ^cfc lfc 91/31 

cga cgt gtg acg gca ctg etc gat gtc aat gtg ctg ate gcg ctg ggc tgg ccg aat cac 
arg arg val thr ala leu leu asp val asn val leu ile ala leu gly trp pro asn his 
asp val OPA arg his cys ser met ser met cys OPA ser arg trp ala gly arg ile thr 
thr cys asp gly thr ala arg cys gin cys ala asp arg ala gly leu ala glu ser arg 
121/41 151/51 

gtt cac cat gcg gec gcg cag cga tgg ttc acg cag ttc tec teg aat ggg tgg gec acc 
val his his ala ala ala gin arg trp phe thr gin phe ser ser asn gly trp ala thr 
phe thr met arg pro arg ser asp gly ser arg ser ser pro arg met gly gly pro pro 
ser pro cys gly arg ala ala met val his ala val leu leu glu trp val gly his his 
181/61 211/71 

acg ccg ate acc gag gca ggg tat gtc cga att tea age aat cgc agt gtg atg cag gtg 
thr pro ile thr glu ala gly tyr val arg ile ser ser asn arg ser val met gin val 
arg arg ser pro arg gin gly met ser glu phe gin ala ile ala val OPA cys arg cys 
ala asp his arg gly arg val cys pro asn phe lys gin ser gin cys asp ala gly val 
241/81 271/91 

teg acc acg ccg get ate gcg ate get cag ttg gcg gcg atg act tct ctt gee ggg cac 
ser thr thr pro ala ile ala ile ala gin leu ala ala met thr ser leu ala gly his 
arg pro arg arg leu ser arg ser leu ser trp arg arg OPA leu leu leu pro gly thr 

asp his ala gly tyr arg asp arg ser val gly gly asp asp phe ser cys arg ala his 
301/101 , , 331/111 „ , , 

acg ttt tgg cct gac gat gtg cca ctg ate gtt ggg age gee ggc gat cgc gat gcg gtg 
thr phe trp pro asp asp val pro leu ile val gly ser ala gly asp arg asp ala val 
arg phe gly leu thr met cys his OPA ser leu gly ala pro ala ile ala met arg cys 

val leu ala) OPA (arg cys ala thr asp arg trp glu arg arg arg ser arg cys gly val 
361/121 391/131 

tec aac cac cgt egg gtc acc gac tgc cat etc ate gec ttg gec gcg cgc tac ggg ggc 
ser asn his arg arg val thr asp cys his leu ile ala leu ala ala arg tyr gly gly 
pro thr thr val gly ser pro thr ala ile ser ser pro trp pro arg ala thr gly ala 
gin pro pro ser gly his arg leu pro ser his arg leu gly arg ala leu arg gly pro 
421/141 451/151 

egg ttg gtc aca ttc gat gee gca ctg gee gat tea gca tec gca ggc etc gtc gag gtg 
arg leu val thr phe asp ala ala leu ala asp ser ala ser ala gly leu val glu val 
gly trp ser his ser met pro his trp pro ile gin his pro gin ala ser ser arg cys 
val gly his ile arg cys arg thr gly arg phe ser ile arg arg pro arg arg gly val 
481/161 .s£a> \t> 511/171 

' - gcg ccc 

ala pro 

rys set his arg giy crp aid axa diy ym <n<i w-" j — -.^ *irg arg pro 

"val val thr gly asp gly arg leu ala arg pro ala gly ser ala gly ala gly ala pro 
541/181 571/191 

ccg gtc gga cac egg cag gee gac get ttt ggc cca cgc gcg cag etc ggc get get ggg 
pro val gly his arg gin ala asp ala phe gly pro arg ala gin leu gly ala ala gly 
arg ser asp thr gly arg pro thr leu leu ala his ala arg ser ser ala leu leu gly 
gly arg thr pro ala gly arg arg phe trp pro thr arg ala ala arg arg cys trp ala 
601/201 631/211 

etc ggg etc ggc ggc age egg etc gaa aac cgt ggt ggc gtc ggc ate gtc gac gaa cca 
leu gly leu gly gly ser arg leu glu asn arg gly gly val gly ile val asp glu pro 
ser gly ser ala ala ala gly ser lys thr val val ala ser ala ser ser thr asn gin 
arg ala arg arg gin pro ala arg lys pro trp trp arg arg his arg arg arg thr arg^ 




SEQ ID Kxx>- 
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661/221 &ZC$>\^>\AX>-&i2. 691/231 

ggt gag ggc ggc ggc tag^ata gcg gta ggt gta ttc ctg ggc gag ctt gcg ggt ttg gca 
gly glu gly gly gly^AMBCile ala val gly val phe leu gly glu leu ala gly leu ala 
val arg ala ala ala arg AMB arg AMB val tyr ser trp ala ser leu arg val trp gin 
i£> iAo.JEy^QE^gly arg arg leu asp ser gly arg cys ile pro gly arg ala cys gly phe gly arg 
721/241 751/251 . 

gaa cac gat egg cac gtt ggg aaa gcc gat ctg caa ttc ggc cag ccc ate ggc gat cgc 
glu his asp arg his val gly lys ala asp leu gin phe gly gin pro ile gly asp arg 
asn thr ile gly thr leu gly lys pro ile cys asn ser ala ser pro ser ala ile ala 

thr arg ser ala arg trp glu ser arg ser ala ile arg pro ala his arg arg ser pro 
781/261 811/271 ' 

cgt egg gcg ggc gaa gga gtg cgc gaa gat etc cga gta gcg gtc etc gac cac cac ggc 
arg arg ala gly glu gly val arg glu asp leu arg val ala val leu asp his his gly 
val gly arg ala lys glu cys ala lys ile ser glu AMB arg ser ser thr thr thr ala 

ser gly gly arg arg ser ala arg arg ser pro ser ser gly pro arg pro pro arg arg 
841/281 871/291 

ggc ccg tgg cag cgc ggc cag ttc ggt cag ttg gta ttt cag gtt gcc gtt cag cac gcc 
gly pro trp gin arg gly gin phe gly gin leu val phe gin val ala val gin his ala 
ala arg gly ser ala ala ser ser val ser trp tyr phe arg leu pro phe ser thr pro 
pro val ala ala arg pro val arg ser val gly ile ser gly cys arg ser ala arg gin 
901/301 931/311 

aga agt aag gtc cgc caa cgc ttt acg etc gac ggc cgc cac gag ttg gcc ggc cac ttt 
arg ser lys val arg gin arg phe thr leu asp gly arg his glu leu ala gly his phe . 
glu val arg ser ala asn ala leu arg ser thr ala ala thr ser trp pro ala thr phe . 
lys) OCH/gly pro pro thr leu tyr ala arg arg pro pro arg val gly arg pro leu ser 
961/321 O>*to. 3£>£. 991/331 

, ,cag ; gcc gta, gtc gcc gca ggg cag ggc ttc ccg cgt cgt ctt cgc ggg ttt gtc ggc aaa 

gin ala val val ala ala gly gin gly phe pro arg arg 'leu arg gly phe val gly lys 
arg pro AMB ser pro gin gly arg ala ser arg val val phe ala gly leu ser ala lys 
gly arg ser arg arg arg ala gly leu pro ala ser ser ser arg val cys arg gin arg 
1021/341 1051/351 

ggt gta ggg gta gcg ttc gtg ggc gtc gac gac gat gtg cag etc ggg gat gcc ggc ggc 
gly val gly val ala phe val gly val asp asp asp val gin leu gly asp ala gly gly 
val AMB gly AMB arg ser trp ala ser thr thr met cys ser ser gly met pro ala ala 
cys arg gly ser val arg gly arg arg arg arg cys ala ala arg gly cys arg arg arg 
1081/361 1111/371 

gcg ggc ggt ggg ggt gcg cac gcc egg ccg cga ctg ttt gcg cgt ttt ggg get ctg cca 
ala gly gly gly gly ala his ala arg pro arg leu phe ala arg phe gly ala leu pro 
arg ala val gly val arg thr pro gly arg asp cys leu arg val leu gly leu cys gin 
gly arg trp gly cys ala arg pro ala ala thr val cys ala phe trp gly ser ala arg 
1141/381 1171/391 

gaa cac cac ctg gcg gcc gcg cgc cat ggt gtg cac cag ttg cga teg gtt etc ccg cgc 
glu his his leu ala ala ala arg his gly val his gin leu arg ser val leu pro arg 
asn thr thr trp arg pro arg ala met val cys thr ser cys asp arg phe ser arg ala 

thr pro pro gly gly arg ala pro trp cys ala pro val ala ile gly ser pro ala arg 
1201/401 1231/411 SCQtS>te-3<1& 

gcg ggc ggc gac gac gtc gat ggc cgc gcc ccg gcg get gca get gcg tagfctc gac ccg 
ala gly gly asp asp val asp gly arg ala pro ala ala ala ala alajAMB^Leu asp pro 
arg ala ala thr thr ser met ala ala pro arg arg leu gin leu arg ser ser thr arg 

gly arg arg arg arg arg trp pro arg pro gly gly cys ser cys val ala arg pro gly 
1261/421 1291/431 

gtc gac gac gac ggg gtc ggc ggg cca gtc ggc gat gtc gag gcg atg gca ata cag cgc 
val asp asp asp gly val gly gly pro val gly asp val glu ala met ala ile gin arg 
ser thr thr thr gly ser ala gly gin ser ala met ser arg arg trp gin tyr ser ala 
arg arg arg arg gly arg arg ala ser arg arg cys arg gly asp gly asn thr ala pro 



SEQ ID 33°t- S52 
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1321/441 1351/451 

ctt ggt gcg egg cca cac gtc tga^ggt ggc gaa gac cag tec cgc gec cac egg cag ccg 
leu gly ala arg pro his val)OPA(gly gly glu asp gin ser arg ala his arg gin pro 
leu val arg gly his thr ser glu val ala lys thr ser pro ala pro thr gly ser arg 
trp cys ala ala thr arg leu arg trp arg arg pro val pro arg pro pro ala ala gly 
1381/461 1411/471 

gat cag gta ggg cag gcg cga gtc ttc age ggg gtt ggc ggc gac gag cag etc cac aga 
asp gin val gly gin ala arg val phe ser gly val gly gly asp glu gin leu his arg 
ile arg AMB gly arg arg glu ser ser ala gly leu ala ala thr ser ser ser thr glu 
ser gly arg ala gly ala ser leu gin arg gly trp arg arg arg ala ala pro gin ser 
1441/48l5 £(S> t I> /(jo. 1471/491 

gtg tga&ggg tac ggg egg cgt acg gca acg gtg aag cag gca etc cga cga ace cat cgt 
val)oPA^gly tyr gly arg arg thr ala thr val lys gin ala leu arg arg thr his arg 
cys glu gly thr gly gly val arg gin arg OPA ser arg his ser asp glu pro ile val 
val arg val arg ala ala tyr gly asn gly glu ala gly thr pro thr asn pro ser ser 
1501/501 

cac gtc gaa ggg gca ggt ga) 
his val glu gly ala gly) 
thr ser lys gly gin val 
arg arg arg gly arg) OPA 



SEQ ID :lA£fi.333-3S2( continued 2) 
FIGURE 15^ (continued (2) 



31/11 

&$> S>i^c>-2&3 TGC GCA TGC CGA CCA GTG TGG TTG GCC GGA GTT CGT TTG TTC GCG ATT GCC TCA ACG ATT 
Jcys ala cys arg pro val trp leu ala gly val arg leu phe ala ile ala ser thr ile 

:<S>fbK*. 3W% 1/21 91/31 

CGA TAT AAC CAC TCT AGT CAC ATC AAC CAC ACT CGT ACC ATC GAG CGT GTG GGT TCA TGC 
arg tyr asn his ser ser his ile asn his thr arg thr ile glu arg val gly ser cys 
121/41 151/51 

CAT GCA TTC GCG ACC GCG GGA GCC GGC GAA CCC GGC GCC ACA CAT AAT CCA GAT TGA GGA 
his ala phe ala thr ala gly ala gly glu pro gly ala thr his asn pro asp) OPAjfgly 
181/61 211/71 ^©u>Ko. 3fi6 

GAC TTC CGT GCC GAA CCG ACG CCG ACG CAA GCT TTC GAC AGC CAT GAG CGC GGT CGC CGC 
asp phe arg ala glu pro thr pro thr gin ala phe asp ser his glu arg gly arg arg 
241/81 271/91 

CCT GGC AGT TGC AAG TCC TTG TGC ATA TTT TCT TGT CTA CGA ATC AAC CGA AAC GAC CGA 
pro gly ser cys lys ser leu cys ile phe ser cys leu arg ile asn arg asn asp arg 
301/101 331/111 

GCG GCC CGA GCA CCA TGA ATT CAA GCA GGC GGC GGT GTT GAC CGA CCT GCC CGG CGA GCT 

ala ala arg ala pro)OPA(ile gin ala gly gly gly val asp arg pro ala arg arg ala 

361/121 sc^ib"^ 391/131 

GAT GTC CGC GCT ATC GCA GGG GTT GTC CCA GTT CGG GAT C 

asp val arg ala ile ala gly val val pro val arg asp^ 



SEQ ID Kcs.353-3ex^ 
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357 32/11 

3S> iOUU>* GCG CAT GCC GAC CAG TGT GGT TGG CCG GAG TTC GTT TGT TCG CGA TTG CCT CAA CGA TTC 

^(ala his ala asp gin cys gly trp pro glu phe val cys ser arg leu pro gin arg phe 

-4&-B>*»-»» 62 /2i 92/31 

GAT ATA ACC ACT CTA GTC ACA TCA ACC ACA CTC GTA CCA TCG AGC GTG TGG GTT CAT GCC 

asp ile thr thr leu val thr ser thr thr leu val pro ser ser val trp val his ala 

122/41 152/51 

ATG CAT TCG CGA CCG CGG GAG CCG GCG AAC CCG GCG CCA CAC ATA ATC CAG ATT GAG GAG 

■ met his ser arg pro arg glu pro ala asn pro ala pro his ile ile gin ile glu glu 

182/61 212/71 

ACT TCC GTG CCG AAC CGA CGC CGA CGC AAG CTT TCG ACA GCC ATG AGC GCG GTC GCC GCC 

thr ser val pro asn arg arg arg arg lys leu ser thr ala met ser ala val ala. ala 

242/81 272/91 

CTG GCA GTT GGA AGT CCT TGT GCA TAT TTT CTT GTC TAC GAA TCA ACC GAA ACG ACC GAG 

leu ala val ala ser pro cys ala tyr phe leu val tyr glu ser thr glu thr thr glu 

302/101 332/111 

CGG. CCC GAG CAC CAT GAA TTC AAG CAG GCG GCG GTG TTG ACC GAC CTG CCC GGC GAG CTG 

arg pro glu his his glu phe lys gin ala ala val leu thr asp leu pro gly glu leu 

362/121 392/131 
ATG TCC GCG CTA TCG CAG GGG TTG TCC CAG TCC GGG ATC 
met ser ala leu ser gin gly leu ser gin phe gly ile} 

, ; ... v , , , , , ( . SEQ ID 1^5 .3^-7-35^ , „ 

FIGURE 16B 



K^rVo.SS^ 33/11 

>C^V AT( , GTG GGC CGG AGT TCG GTT GAT TGC AAC GAT TCG 

^-Ct> i£> Ao. 3<o0^rg met pro thr ser val val gly arg ser ser phe val arg asp cys leu asn asp ser 
63/21 93/31 

ATA TAA CCA CTC TAG TCA CAT CAA CCA CAC TCG TAC CAT CGA GCG TGT GGG TTC ATG CCA 
ile^OCH pro leu AME^ser his gin pro his ser tyr his arg ala cys gly phe met pro 
123/41 3^<S> 153/51 

TGC ATT CGC GAC CGC GGG AGC CGG CGA ACC CGG CGC CAC ACA TAA TCC AGA TTG AGG AGA 
cys ile arg asp arg gly ser arg arg thr arg arg his thr)oCH^ser arg leu arg arg 
183/61 213/71 3£<S? |b /Lo. 927 

CTT CCG TGC CGA ACC GAC GCC GAC GCA AGC TTT CGA CAG CCA TGA GCG CGG TCG CCG CCC 
leu pro cys arg thr asp ala asp ala ser phe arg gin pro^OPA^ala arg ser pro pro 
243/81 273/91 S££> lb K£>, Q 

TGG CAG TTG CAA GTC CTT GTG CAT ATT TTC TTG TCT ACG AAT CAA CCG AAA CGA CCG AGC 
trp gin leu gin val leu val his ile phe leu ser thr asn gin pro lys arg pro ser 
303/101 333/111 

GGC CCG AGC ACC ATG AAT TCA AGC AGG CGG CGG TGT TGA CCG ACC TGC CCG GCG AGC TGA 
gly pro ser thr met asn ser ser arg arg arg cys) OPA (pro thr cys pro ala ser) OPA 
363/121 393/131 S€(S> lb KO. 

TGT CCG CGC TAT CGC AGG GGT TGT CCC AGT TCG GGA TC 
(cys pro arg tyr arg arg gly cys pro ser ser gly) 

SEQ ID Kos>.35fl-3aA 926-930 
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;£U > QCG GGC 


CAC 


CGA 


TCA 


GTC 


GAT 


CGG 


GTG 


£>\fc>.<Si^ala gly 


his 


arg 


ser 


val 


asp 


arg 


val 


61/21 
















GCA GTG 


ACG 


ACA 


CCA 


GCG 


CAG 


GAC 


GCG 


ala val 


thr 


thr 


pro 


ala 


gin 


asp 


ala 


121/41 
















GGC TCG 


CCT 


TTT 


TTT 


CAT 


CAA 


CGT 


TGG 


gly ser 


pro 


phe 


phe 


his 


gin 


arg 


trp 


181/61 
















TTC GGT 


CAC 


CTG 


ACG 


GTC 


GGG 


ATG 


TTC 


phe gly 


his 


leu 


thr 


val 


gly 


met 


phe 


241/81 
















TGC CCT 


GCT 


GGT 


GCG 


GCG 


TTC 


GGC 


CGA 


cys pro 


ala 


gly 


ala 


ala 


phe 


gly 


arg 


301/101 
















GTC GAT 


GGC 


CCT 


CAA 


CTC 


GGC 


ATC 


GCG 


val asp 


gly 


pro 


gin 


leu 


gly 


ile 


ala 





31/11 


















GTT 


TCC GCT 


CCA 


TCA 


GCC 


CGG 


AAT 


TGA 


GGT 


GCC 


val 


ser ala 


pro 


ser 


ala 


arg 


asn) OPA (gly 


ala 




91/31 












CCG 


TTG GTG 


TTT 


CCC 


TCT 


GTT 


GCT 


TTC 


CCG 


TCC 


pro 


leu val 


phe 


pro 


ser 


val 


ala 


phe 


pro 


ser 




151/51 


















ACT 


GCC GCA 


GTG 


GCG 


ATG 


TTG 


GTC 


GCC 


GGC 


GTG 


thr 


ala ala 


val 


ala 


met 


leu 


val 


ala 


gly 


val 




211/71 


















CTT 


GGG TCT 


CGG 


GTT 


GCT 


GCT 


GGG 


TTT 


GCT 


CAA 


leu 


gly ser 


arg 


val 


ala 


ala 


gly 


phe 


ala 


gin 




271/91 


















GTC 


GAT CAC 


CGC 


CAA 


AGA 


GCA 


CCC 


GTT 


AAA 


ACG 


val 


asp his 


arg 


gin 


arg 


ala 


pro 


val 


lys 


thr 




331/111 


















ACT 


GGC GAT 


TAT 


CAC 


CAT 


GCC 


TCG 


GGC 


TGA 


TC 


thr 


gly asp 


tyr 


his 


his 


ala 


ser 


gly) OPA 





SEQ ID Ao&. &o( - 3^> 
FIGURE 17A 



32/11 

3(*V-^CGG GCC ACC GAT CAG TCG ATC GGG TGG TTT CCG CTC CAT CAG CCC GGA ATT GAG GTG CCG 
^ o. * /jC /arg ala thr asp gin ser ile gly trp phe pro leu his gin pro gly ile glu val pro 
S>A*"5C£V 62/21 92/31 

CAG TGA CGA CAC CAG CGC AGG ACG CGC CGT TGG TGT TTC CCT CTG TTG CTT TCC CGT CCG 
gin) OPA (arg his gin arg arg thr arg arg trp cys phe pro leu leu leu ser arg pro 
122741^^ x ^^3<o^ 152/51 

GCT CGC CTT TTT TTC ATC AAC GTT GGA CTG CCG CAG TGG CGA TGT TGG TCG CCG GCG TGT 
ala arg leu phe phe ile asn val gly leu pro gin trp arg cys trp ser pro ala cys 
182/61 212/71 

TCG GTC ACC TGA CGG TCG GGA TGT TCC TTG GGT CTC GGG TTG CTG CTG GGT TTG CTC AAT 
ser val thr) OPA krg ser gly cys ser leu gly leu gly leu leu leu gly leu leu asn 
242/81 SaS>toW>.3<^7 272/91 

GCC CTG CTG GTG CGG CGT TCG GCC GAG TCG ATC ACC GCC AAA GAG CAC CCG TTA AAA CGG 
ala leu leu val arg arg ser ala glu ser ile thr ala lys glu his pro leu lys arg 
302/101 332/111 

TCG ATG GCC CTC AAC TCG GCA TCG CGA CTG GCG ATT ATC ACC ATG CCT CGG GCT GAT C 
ser met ala leu asn ser ala ser arg leu ala ile ile thr met pro arg ala asp) 



SEQ ID fexs.3fcfl -3c*7 



FIGURE 17B 



59/185 



33/11 

jfc> |S> ^ 3UK GGG cca CCG ATC AGT CGA TCG GGT GGT TTC CGC TCC ATC AGC CCG GAA TTG AGG TGC CGC 
n falv pro pro ile ser arg ser gly gly phe arg ser ile ser pro glu leu arg cys arg 
;o> <t>^* 3c ^63/ 2 i 93/31 

AGT GAC GAC ACC AGC GCA GGA CGC GCC GTT GGT GTT TCC CTC TGT TGC TTT CCC GTC CGG 
ser asp asp thr ser ala gly arg ala val gly val ser leu cys cys phe pro val arg 
123/41 153/51 

CTC GCC TTT TTT TCA TCA ACG TTG GAC TGC CGC AGT GGC GAT GTT GGT CGC CGG CGT GTT 
leu ala phe phe ser ser thr leu asp cys arg ser gly asp val gly arg arg arg val 
' 183/61 213/11 

CGG TCA CCT GAC GGT CGG GAT GTT CCT TGG GTC TCG GGT TGC TGC TGG GTT TGC TCA ATG 
arg ser pro asp gly arg asp val pro trp val ser gly cys cys trp val cys ser met 
243/81 273/91 

CCC TGC TGG TGC GGC GTT CGG CCG AGT CGA TCA CCG CCA AAG AGC ACC CGT TM AAC GGT 
pro cys trp cys gly val arg pro ser arg ser pro pro lys ser thr arg)OCH(asn gly 
303/101 333/111 S£® (b *0).370 

CGA TGG CCC TCA ACT CGG CAT CGC GAC TGG CGA TTA TCA CCA TGC CTC GGG CTG ATC 
arg trp pro ser thr arg his arg asp trp arg leu ser pro cys leu gly leu ile) 



SEQ ID >tos. 3^-3-70 
FIGURE 17C 



part of the nucleotide sequence of seq!7A 
1/1 31/11 

Cggc tag aac ccc gaa gga gac etc gcg ggt tgc egg ccc ccg gec cat egg. atg cgt ate 
alv AMB 4sn pro glu gly asp leu ala gly cys arg pro pro ala his arg met arg ile 

^ 91/31 
cig^i^egf^a^tc acg acc gac ata ggg age tac ccc ttg ggt gat tec ggt gcg acg 
arq ser arg arg phe thr thr asp ile gly ser tyr pro leu gly asp ser gly ala thr 
121/41 151/51 

act gcg ata cgc teg gcg ggc cac cga tea gtc gat egg gtg gtt tec get cca tea gee 
thr ala ile arg ser ala gly his arg ser val asp arg val val ser ala pro ser ala 
181/61 211/71 

egg aat tga ggt gee gca gtg acg aca cca gcg cag gac gcg ccg ttg gtg ttt ccc tct 
arg asn^OPA(gly ala ala val thr thr pro ala gin asp ala pro leu val phe pro ser 

gtt'gct tt? c^cc^gtt cgc ctt ttt ttc Ttc'aac gtt gga ctg gec gca gtg gcg atg 
val ala phe arg pro val arg leu phe phe ile asa val gly leu ala ala val ala met 
301/101 331/111 

ttg gtc gee ggc gtg ttc ggt cac ctg acg gtc ggg atg ttc ttg ggt etc ggg ttg ctg 
leu val ala gly val phe gly his leu thr val gly met phe leu gly leu gly leu leu 
361/121 391/131 

ctg ggt ttg etc aat gee ctg ctg gtg egg cgt teg gee gag teg ate acc gec aaa gag 
leu gly leu leu asn ala leu leu val arg arg ser ala glu ser lie thr ala lys glu 
421/141 451/151 

cac ccg tta aaa egg teg atg gee etc aac teg gca teg cga ctg gcg att ate acc ate 
his pro leu lys arg ser met ala leu asn ser ala ser arg leu ala lie lie thr He 
481/161 

etc ggg ctg ate 
leu gly leu ile^ 

SEQ ID 3*71 -3"£? 



FIGURE 17A* 



60/185 



1/1 31/11 
30£> \&\te>*3m gc t aga acc ccg aag gag acc teg egg gtt gec ggc ccc egg ccc ate gga tgc gta tec 
*(ala arg thr pro lys glu thr ser arg val ala gly pro arg pro ile gly cys. val ser 
'C^ 61/21 91/31 

ggt cgc gee gat tea cga ccg aca tag gga gct acc cct tgg gtg att ccg gtg cga cga 

gly arg ala asp ser arg pro thrjAMB (gly ala thr pro trp val ile pro val arg arg 

121/41 s£$> ifclMx^Tfr 153751 

ctg cga tac gct egg egg gee acc gat cag teg ate ggg tgg ttt ccg etc cat cag ccc 
leu arg tyr ala arg arg ala thr asp gin ser ile gly trp phe pro leu his gin pro 
181/61 211/71 

gga att gag gtg ccg cag tga cga cac cag cgc agg acg cgc cgt tgg tgt ttc cct ctg 
gly ile glu val pro gln)OPA(arg his gin arg arg thr arg arg trp cys phe pro leu 
241/81 SdCfctofccST 1 211 ' 91 

ttg ctt tec gtc egg ttc gec ttt ttt tea tea acg ttg gac tgg ccg cag tgg cga tgt 
leu leu ser val arg phe ala phe phe ser ser thr leu asp trp pro gin trp arg cys 
301/101 331/111 

tgg teg ccg gcg tgt teg gtc acc tga egg teg gga tgt tct tgg gtc teg ggt tgc tgc 
trp ser pro ala cys ser val thr)oPA (arg ser gly cys ser trp val ser gly cys cys 
361/121 ^>63i 37*391/131 

tgg gtt tgc tea atg ccc tgc tgg tgc ggc gtt egg ccg agt cga tea ccg cca aag age 
trp val cys ser met pro cys trp cys gly val arg pro ser arg ser pro pro lys ser 
421/141 451/151 

acc cgt taa aac ggt cga tgg ccc tea act egg cat cgc gac tgg cga tta tea cca tec 
thr arg)0CH(asn gly arg trp pro ser thr arg his arg asp trp arg leu ser pro ser 
481/161 l0 ^ 

teg ggc tga tc 
ser gly^OPA 



SEQ ID \^.3W^^ 
FIGURE 17B f 



1/1 31/11 

^»D^x>. 3fcO eta gaa ccc cga agg aga cct cgc ggg ttg ccg gee ccc ggc cca teg gat gcg tat ccg 
(leu glu pro arg arg arg pro arg gly leu pro ala pro gly pro ser asp ala tyr pro 

t<S>lbAo. 3%* N 61/21 91/31 

gtc gcg ccg att cac gac cga cat agg gag eta ccc ctt ggg tga ttc egg tgc gac gac 
val ala pro ile his asp arg his arg glu leu pro leu gly^OPA tone arg cys asp asp 
121/41 151/51 Sc» lb Kf>. 3*1 

tgc gat acg etc ggc ggg cca ccg ate agt cga teg ggt ggt ttc cgc tec ate age ccg 
cys asp thr leu gly gly pro pro ile ser arg ser gly gly phe arg ser ile ser pro 
181/61 211/71 

gaa ttg agg tgc cgc agt gac gac acc age gca gga cgc gec gtt ggt gtt tec etc tgt 
glu leu arg cys arg ser asp asp thr ser ala gly arg ala val gly val ser leu cys 
241/81 271/91 

tgc ttt ccg tec ggt teg cct ttt ttt cat caa cgt tgg act ggc cgc agt ggc gat gtt 
cys phe pro ser gly ser pro phe phe his gin arg trp thr gly arg ser gly asp val 
301/101 331/111 

ggt cgc egg cgt gtt egg tea cct gac ggt egg gat gtt ctt ggg tct egg gtt gct gct 
gly arg arg arg val arg ser pro asp gly arg asp val leu gly ser arg val ala ala 
361/121 391/131 

ggg ttt gct caa tgc cct gct ggt gcg gcg ttc ggc cga gtc gat cac cgc caa aga gca 
gly phe ala gin cys pro ala gly ala ala phe gly arg val asp his arg gin arg ala 
421/141 451/151 

ccc gtt aaa acg gtc gat ggc cct caa etc ggc ate gcg act ggc gat tat cac cat cct 

pro val lys thr val asp gly pro gin leu gly ile ala thr gly asp tyr his his pro 

481/161 

egg gct gat c 

arg ala asp) 

SEQ ID Hos.c3S0-5B£ 



FIGURE 17C 



61/185 



sequence Rvl303 predicted by Cole et al. (Nature 393:537-544) and partially containing 
Seql 
1/1 



Seql7A' 

31/11 



rto it>i^5S^atg acg aca cca gcg cag gac gcg ccg ttg gtg ttt ccc tct gtt get ttc cgt ccg gtt 
«dfmet thr thr pro ala gin asp ala pro leu val phe pro ser val ala phe arg pro val 

:<S> lD* te ' SW 61/21 91/31 

cgc ctt ttt ttc ate aac gtt gga ctg gcc gca gtg gcg atg ttg gtc gcc ggc gtg ttc 
arc leu phe phe ile asn val gly leu ala ala val ala met leu val ala gly val phe 
121/41 151/51 

ggt cac ctg acg gtc ggg atg ttc ttg ggt etc ggg ttg ctg ctg ggt ttg etc aat gcc 
alv his leu thr val gly met phe leu gly leu gly leu leu leu gly leu leu asn ala 
181/61 211/71 
, ctg ctg gtg egg cgt teg gcc gag teg ate acc gcc aaa gag cac ccg tta aaa egg teg 
leu leu val arg arg ser ala.glu ser ile thr ala lys glu his pro leu lys arg ser 
241/81 271/91 

atg gcc etc aac teg gca teg cga ctg gcg att ate acc ate etc ggg ctg ate ate gcc 
met ala leu asn ser ala ser arg leu ala ile ile thr ile leu gly leu ile ile ala 
301/101 331/111 

tac att ttc egg ccc get gga ttg ggc gtc gtg ttc ggg ctg gca ttc ttc cag gtg ctg 
tvr ile phe arg pro ala gly leu gly val val phe gly leu ala phe phe gin val leu 
361/121 391/131 

ctg gtg gca acg acg gcc ctg ccg gtc ctg aag aag ctg cgc act gcg acc gag gaa ccg 
leu val ala thr thr ala leu pro val leu lys lys leu arg thr ala thr glu glu pro 
421/141 451/151 

gtc gca act tat tct tec aat ggc cag acc ggg gga teg gaa gga agg age gcc age gat 
val ala thr tyr ser ser asn gly gin thr gly gly ser glu gly arg ser ala ser asp 
481/161 
gac tga 
asp) OPA 

SEQ ID VU».3S5~33* 
FIGURE 17D 



Orf according to Cole et al. (Nature 393:537-544) and containing Rvl303 

1/1 31/11 
£<S> 3Sfe-tga ggt gcc gca gtg acg aca cca gcg cag gac gcg ccg ttg gtg ttt ccc tct gtt get 

OPAJblv ala ala val thr thr pro ala gin asp ala pro leu val phe pro ser val ala 
i*&*o<3Sfc>-^§J[ 91/31 

ttc cgt ccg gtt cgc ctt ttt ttc ate aac gtt gga ctg gcc gca gtg gcg atg ttg gtc 
phe arg pro val arg leu phe phe ile asn val gly leu ala ala val ala met leu val 
121/41 151/51 

qcc ggc gtg ttc ggt cac ctg acg gtc ggg atg ttc ttg ggt etc ggg ttg ctg ctg ggt 
ala gly val phe gly his leu thr val gly met phe leu gly leu gly leu leu leu gly 
181/61 211/71 

ttg etc aat gcc ctg ctg gtg egg cgt teg gcc gag teg ate acc gcc aaa gag cac ccg 
leu leu asn ala leu leu val arg arg ser ala glu ser ile thr ala lys glu his pro 
241/81 271/91 

tta aaa egg teg atg gcc etc aac teg gca teg cga ctg gcg att ate acc ate etc ggg 
leu lys arg ser met ala leu asn ser ala ser arg leu ala ile ile thr ile leu gly 
301/101 331/111 

eta ate ate gcc tac att ttc egg ccc get gga ttg ggc gtc gtg ttc ggg ctg gca ttc 
leu ile ile ala tyr ile phe arg pro ala gly leu gly val val phe gly leu ala phe 
361/121 391/131 

ttc cag gtg ctg ctg gtg gca acg acg gcc ctg ccg gtc ctg aag aag ctg cgc act gcg 
phe gin val leu leu val ala thr thr ala leu pro val leu lys lys leu arg thr ala 
421/141 451/151 

acc gag gaa ccg gtc gca act tat tct tec aat ggc cag acc ggg gga teg gaa gga agg 
thr glu glu pro val ala thr tyr ser ser asn gly gin thr gly gly ser glu gly arg 
481/161 



age gcc age gat gac tga 
ser ala ser asp asp)OPA 



SEQ ID Kx*. 3BS- 3*^ 



FIGURE 17^ 



62/185 



31/11 

£(S> »t> GTC GAA CAG GTA CGG AAG GCG CCG TCG GTC GCT CGG TCC GCT GGT ATC TCG TGT TCA GCC 

, /val olu qln val arg lys ala pro ser val ala arg ser ala gly ile ser cys ser ala 

IttiOitcS*? V 61/2 i 91/31 

AGC CAG CGG CCG TTA ACG TGG CCG AAC AGG TCG TCT TGG GGT CGG GCA TCA GCG TCG ATG 
ser gin arg pro leu thr trp pro asn arg ser ser trp gly arg ala ser ala ser met 
121/41 151/51 

TGG CTC AGG TCG ATA CCC GAG GGG ATG GCA AGT GTC ACC CCG CCA TCC TTC CAC CTC TTT 
trp leu arg ser ile pro glu gly met ala ser val thr pro pro ser phe his leu phe 
181/61 211/71 

TCG GGT GCA ACG ATC GGG CCA TGC CTG ACG GGG AGC AGA GCC AGC CAC CGG CCC AAG AAG 
ser gly ala thr ile gly pro cys leu thr gly ser arg ala ser his arg pro lys lys 
241/81 271/91 

ATG CGG AAG ACG ACT CGC GGC CCG ACG CCG CGG AGG CCG CCG CGG CCG AAC CCA AAT CAT 
met arg lys thr thr arg gly pro thr pro arg arg pro pro. arg pro asn pro asn his 
301/101 331/111 

CAG CCG GTC CCG ATG TTC TCG ACC TAC GGT ATC GCC TCG ACA CTA CTC GGC GTG CTA TCG 
gin pro val pro met phe ser thr tyr gly ile ala ser thr leu leu gly val leu ser 
361/121 

GTC GCC GCG GTC GTG CTG GGT GCG ATG ATC 
val ala ala val val leu gly ala met ile) 



SEQ ID 

FIGURE 18A 



32/11 

£<Si lb A* 3&} T CG AAC AGG TAC GGA AGG CGC CGT CGG TCG CTC GGT CCG CTG GTA TCT CGT GTT CAG CCA 
^ i >d/s (ser asn arg tyr gly arg arg arg arg ser leu gly pro leu val ser arg val gin pro 

^ iO lixv-SYO \ 2/21 92/31 

GCC AGC GGC CGT TAA CGT GGC CGA ACA GGT CGT CTT GGG GTC GGG CAT CAG CGT CGA TGT 
ala ser gly arg)oCH(arg gly arg thr gly arg leu gly val gly his gin arg arg cys 
122/41 SZ& Il> *U>.3<*/ 152/51 

GGC TCA GGT CGA TAC CCG AGG GGA TGG CAA GTG TCA CCC CGC CAT CCT TCC ACC TCT TTT 
gly ser gly arg tyr pro arg gly trp gin val ser pro arg his pro ser thr ser phe 
182/61 212/71 

CGG GTG CAA CGA TCG GGC CAT GCC TGA CGG GGA GCA GAG CCA GCC ACC GGC CCA AGA AGA 
arg val gin arg ser gly his ala)oPA(arg gly ala glu pro ala thr gly pro arg arg 
242/81 S£S> 332.272/91 

TGC GGA AGA CGA CTC GCG GCC CGA CGC CGC GGA GGC CGC CGC GGC CGA ACC CAA ATC ATC 
cys gly arg arg leu ala ala arg arg arg gly gly arg arg gly arg thr gin ile lie 
302/101 332/111 

AGC CGG TCC CGA TGT TCT CGA CCT ACG GTA TCG CCT CGA CAC TAC TCG GCG TGC TAT CGG 
ser arg ser arg cys ser arg pro thr val ser pro arg his tyr ser ala cys tyr arg 
362/121 

TCG CCG CGG TCG TGC TGG GTG CGA TGA TC 
ser pro arg ser cys trp val arg] OPA 



SEQ ID Kcc, 
FIGURE 18B 



63/185 



3/1 33/11 
5£> »V>^-*3H CGA ACA GGT ACG GAA GGC GCC GTC GGT CGC TCG GTC CGC TGG TAT CTC GTG TTC AGC CAG 
as> »e> / arq tnr gly thr glu gly ala val gly arg ser val arg trp tyr leu val phe ser gin 

63/21 93/31 

CCA GCG GCC GTT AAC GTG GCC GAA CAG GTC GTC TTG GGG TCG GGC ATC AGC GTC GAT GTG 
pro ala ala val asn val ala glu gin val val leu gly ser gly ile ser val asp val 
123/41 153/51 

GCT CAG GTC GAT ACC CGA GGG GAT GGC AAG TGT CAC CCC GCC ATC CTT CCA CCT CTT TTC 
• ala gin val asp thr arg gly asp gly lys cys his pro ala ile leu pro pro leu phe 
183/61 213/71 

GGG TGC AAC GAT CGG GCC ATG CCT GAC GGG GAG CAG AGC CAG CCA CCG GCC CAA GAA GAT 
qly cys asn asp arg ala met pro asp gly glu gin ser gin pro pro ala gin glu asp 
243/81 273/91 

GCG GAA GAC GAC TCG CGG CCC GAC GCC GCG GAG GCC GCC GCG GCC GAA CCC AAA TCA TCA 
ala glu asp asp ser arg pro asp ala ala glu ala ala ala, ala glu pro lys ser ser 
303/101 333/111 

GCC GGT CCC GAT GTT CTC GAC CTA CGG TAT CGC CTC GAC ACT ACT CGG CGT GCT ATC GGT 
ala gly pro asp val leu asp leu arg tyr arg leu asp thr thr arg arg ala ile gly 
363/121 

CGC CGC GGT CGT GCT GGG TGC GAT GAT C 
arg arg gly arg ala gly cys asp asp) 

SEQ ID tu&-S<i&-&* < 1 
FIGURE 18C 



part of the nucleotide sequence of seq!8A 

fn ' 31/11 

cTfl> lOAJ> 3% . GAA GGC GCC GTC GGT CGC TCG GTC CGC TGG TAT CTC GTG TTC AGC CAG CCA GCG GCC GTT 
c&l&tic. 3<3<* folu gly ala val gly arg ser val arg trp tyr leu val phe ser gin pro ala ala val 

61/21 91/31 

AAC GTG GCC GAA CAG GTC GTC TTG GGG TCG GGC ATC AGC GTC GAT GTG GCT CAG GTC GAT 
asn val ala glu gin val val leu gly ser gly ile ser val asp val ala gin val asp 

ACC 7 CGA GGG GAT GGC AAG TGT CAC CCC GCC ATC CTT CCA CCT CTT TTC GGG TGC AAC GAT 
thr arg gly asp gly lys cys his pro ala ile leu pro pro leu phe gly cys asn asp 
181/61 211/71 

CGG GCC ATG CCT GAC GGG GAG CAG AGC CAG CCA CCG GCC CAA GAA GAT GCG GAA GAC GAC 
arg ala met pro asp gly glu gin ser gin pro pro ala gin glu asp ala glu asp asp 
241/81 271/91 

TCG CGG CCC GAC GCC GCG GAG GCC GCC GCG GCC GAA CCC AAA TCA TCA GCC GGT CCG ATG 
ser arg pro asp ala ala glu ala ala ala ala glu pro lys ser ser ala gly pro met 
301/101 331/111 

TTC TCG ACC TAC GGT ATC GCC TCG ACA CTA CTC GGC GTG CTA TCG GTC GCC GCG GTC GTG 
phe ser thr tyr gly ile ala ser thr leu leu gly val leu ser val ala ala val val 
361/121 

CTG GGT GCG ATG ATC 
leu gly ala met ile} 



SEQ ID Xce>. 3^5- mt? 



FIGURE 18A f 



64/185 



1/1 

i& \y^> iAuD.3<V?CGG AAG GCG CCG TCG GTC GCT CGG TCC 
& t^Ko. ( ar ? lys ala pr ° Ser Val ala arg Ser 

TTA ACG TGG CCG AAC AGG TCG TCT TGG 
leu thr trp pro asn arg ser ser trp 
121/41 

ATA CCC GAG GGG ATG GCA AGT GTC ACC 
ile pro glu gly met ala ser val thr 
181/61 

ATC GGG CCA TGC CTG ACG GGG AGC AGA 
ile gly pro cys leu thr gly ser arg 
241/81 

ACT CGC GGC CCG ACG CCG CGG AGG CCG 
thr arg gly pro thr pro arg arg pro 
301/101 

TGT TCT CGA CCT ACG GTA TCG CCT CGA 
cys ser arg pro thr val ser pro arg 
361/121 

TGC TGG GTG CGA TGA TC 
cys trp val arg^OPA 



31/11 

GCT GGT ATC TCG TGT TCA GCC AGC CAG CGG CCG 
ala gly ile ser cys ser ala ser gin arg pro 
91/31 

GGT CGG GCA TCA GCG TCG ATG TGG CTC AGG TCG 
gly arg ala ser ala ser met trp leu arg ser 
151/51 

CCG CCA TCC TTC CAC CTC TTT TCG GGT GCA ACG 
pro pro ser phe his leu phe ser gly ala thr 
211/71 

GCC AGC CAC CGG CCC AAG AAG ATG CGG AAG ACG 
ala ser his arg pro lys lys met arg lys thr 
271/91 

CCG CGG CCG AAC CCA AAT CAT CAG CCG GTC CGA 
pro arg pro asn pro asn his gin pro val arg 
331/111 

CAC TAC TCG GCG TGC TAT CGG TCG CCG CGG TCG 
his tyr ser ala cys tyr arg ser pro arg ser 



SEQ ID ^.50 7-3% 
FIGURE 18B' 



1/1 31/11 



;£<$> i£>fo'3Q9 GGA AGG 


CGC 


CGT 


CGG 


TCG 


CTC 


GGT 


CCG 


CTG 


GTA TCT 


CGT 


GTT 


CAG 


CCA 


GCC 


AGC 


GGC 


CGT 




arg 


arg 


arg 


ser 


leu 


gly 


pro 


leu 


val ser 


arg 


val 


gin 


pro 


ala 


ser 


gly 




61/21 


















91/31 


















TAA CGT 


GGC 


CGA 


ACA 


GGT 


CGT 


CTT 


GGG 


GTC 


GGG CAT 


CAG 


CGT 


CGA 


TGT 


GGC 


TCA 


GGT 


CGA 


CX£> »C> Ko.QOt J2CS*(arg 


gly 


arg 


thr 


gly 


arg 


leu 


gly 


val 


gly his 


gin 


arg 


arg 


cys 


gly 


ser 


gly 


arg 


121/41 


















151/51 


















TAC CCG 


AGG 


GGA 


TGG 


CAA 


GTG 


TCA 


CCC 


CGC 


CAT CCT 


TCC 


ACC 


TCT 


TTT 


CGG 


GTG 


CAA 


CGA 


tyr pro 


arg 


gly 


trp 


gin 


val 


ser 


pro 


arg 


his pro 


ser 


thr 


ser 


phe 


arg 


val 


gin 


arg 


181/61 


















211/71 


















TCG GGC 


CAT 


GCC 


TGA 


CGG 


GGA 


GCA 


GAG 


CCA 


GCC ACC 


GGC 


CCA 


AGA 


AGA 


TGC 


GGA 


AGA 


CGA 


ser gly 


his 


ala) 
< 


OPA 


/arg 


gly 


ala 


glu 


pro 


ala thr 


gly 


pro 


arg 


arg 


cys 


gly 


arg 


arg 


241/81 










271/91 


















CTC GCG 


GCC 


CGA* 


CGC 


CGC 


GGA 


GGC 


CGC 


CGC 


GGC CGA 


ACC 


CAA 


ATC 


ATC 


AGC 


CGG 


TCC 


GAT 


leu ala 


ala 


arg 


arg 


arg 


gly 


gly 


arg 


arg 


gly arg 


thr 


gin 


ile 


ile 


ser 


arg 


ser 


asp 


301/101 


















331/111 
















CGT 


GTT CTC 


GAC 


CTA 


CGG 


TAT 


CGC 


CTC 


GAC 


ACT 


ACT CGG 


CGT 


GCT 


ATC 


GGT 


CGC 


CGC 


GGT 


val leu 


asp 


leu 


arg 


tyr 


arg 


leu 


asp 


thr 


thr arg 


arg 


ala 


ile 


gly 


arg 


arg 


gly 


arg 



361/121 

GCT GGG TGC GAT GAT C 
ala gly cys asp asp) 

SEQ ID loos, 3WsM0Z. 



FIGURE 18C 
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sequence Rv0199 predicted by Cole 
seql8A' 

1/1 

S&iDfto. <-K>3 atg cct gac ggg gag cag age cag cca 

ir^<*w( Met P ro asp gly glu gln ser gln pro 
' 61/21 

ccc gac gec gcg gag gec gec gcg gec 
. pro asp ala ala glu ala ala ala ala 
121/41 

acc tac ggt ate gec teg aca eta etc 
thr tyr gly ile ala ser thr leu leu 
181/61 

gcg atg ate tgg tec gca cac cgc gat 
ala met ile trp ser ala his arg asp 
241/81 

atg ctg acc gee get gaa tgg acg gee 
met leu thr ala ala glu trp thr ala 
301/101 

gec age ctg cag cga ctg cac gac gga 
. ala ser leu gln arg leu his asp gly 
361/121 

gtc gtg cag ccc tac egg cag gtg gtg 
val val gln pro tyr arg gln val val 
421/141 

gag gcg gta gcg ate gat acg gtg cac 
glu ala val ala ile asp thr val his 
481/161 

ccg gta gta acc acg aaa ttg cca ccg 
pro val val thr thr lys leu pro pro 
541/181 

gcg acg teg gtc agt gag aac gec ggc 
ala thr ser val ser glu asn ala gly 
601/201 

etc gat gtc tec gat gtg gac ggc aag 
leu asp val ser asp val asp gly lys 



et al. (Nature 393:537-544) and containing 



31/11 

ccg gee caa gaa gat gcg gaa gac gac teg egg 
pro ala gln glu asp ala glu asp asp ser arg 
91/31 

gaa ccc aaa tea tea gee ggt ccg atg ttc teg 
glu pro lys ser ser ala gly pro met phe ser 
151/51 

ggc gtg eta teg gtc gee gcg gtc gtg ctg ggt 
gly val leu ser val ala ala val val leu' gly 
211/71 

gac tec ggc gag cgt acc tac ctg acc egg gtc 
asp ser gly glu arg thr tyr leu thr arg val 
271/91 

gtg ctg ate aac atg aac gee gac aac ate gat 
val leu ile asn met asn ala asp asn ile asp 
331/111 

acg gtc ggt caa etc aac acc gac ttc gac get 
thr val gly gln leu asn thr asp phe asp ala 
391/131 

gag aag ttg egg acg cac age age ggc agg ate 
glu lys leu arg thr his ser ser gly arg ile 

451/151' ' 
cgc gag ctg gat acc cag tec ggt gec gee cga 
arg glu leu asp thr gln ser gly ala ala arg 

511/171 

ttt gec act cgc acc gac teg gtg ctg ctg gtc 
phe ala thr arg thr asp ser val leu leu val 
571/191 

gee aaa ccc cag acc gtg cac tgg aac ttg egg 
ala lys pro gln thr val his trp asn leu arg 
631/211 

ctg atg ate tec egg ttg gag teg att cga tga 
leu met ile ser arg leu glu ser ile arg^OPA 



SEQ ID Xjos. tjO3-^0V 
FIGURE 18D 



66/185 



ORF according to Cole et al. (Nature 393:537-544) and containing Rv0199 

1/x 31/11 
CS> OAo.^/o&taa tec gat gec gga ttg ggt gaa atg cac caa gta acg ggt cga gtc ttt gga ate ggt 

OCH (ser asp ala gly leu gly glu met his gin val thr gly arg val phe gly ile gly 
;<$> vr>W^. Wfe 61/21 91/31 

ate gac ata gac tec gat gec gee gee cac gec ggc acg ttg cag agt gee aag ggc ggc 

ile asp ile asp ser asp ala ala ala his ala gly thr leu gin ser ala lys gly gly 

121/41 151/51 

ggc caa ttc ggt ggc gtc ggc cgc get gtc aat cgt ggc caa ttc gtc gtg cag egg ttg 
qly gin phe gly gly val gly arg ala val asn arg gly gin phe val val gin arg leu 
181/61 211/71 

cac ccc tgc gcg etc gac ggc ttc etc gtc gag gaa get ggc gta gag gtc gec gat gcg 
his pro cys ala leu asp gly phe leu val glu glu ala gly val glu val ala asp ala 
241/81 271/91 

ctg cgc ate ggt gec tac cgc age acc tgc ttg get ggc ctg gat gat cag gtc teg cac 
leu arg ile gly ala tyr arg ser thr cys leu ala gly leu asp asp gin val ser his 
301/101 331/111 

ttg tgt etc ggc gcg gtc gaa cag get acg gaa ggc gee gtc ggt cgc teg gtc cgc tgg 
leu cys leu gly ala val glu gin ala thr glu gly ala val gly arg ser val arg trp 
361/121 391/131 

tat etc gtg ttc age cag cca gcg gee gtt aac gtg gee gaa cag gtc gtc ttg ggg teg 
tyr leu val phe ser gin pro ala ala val asn val ala glu gin val val leu gly ser 
421/141 451/151 

ggc ate age gtc gat gtg get cag gtc gat acc cga ggg gat ggc aag tgt cac ccc gee - 
qly ile ser val asp val ala gin val asp thr arg gly asp gly lys cys his pro ala 
481/161 511/171 

ate ctt cca cct ctt ttc ggg tgc aac gat egg gec atg cct gac ggg gag cag age cag 
• ile leu pro pro leu phe gly cys asn asp arg ala met pro asp gly glu gin , ser gin ,, 

541/181 571/191 

cca ccg gec caa gaa gat gcg gaa gac gac teg egg ccc gac gec gcg gag gec gec gcg 
pro pro ala gin glu asp ala glu asp asp ser arg pro asp ala ala glu ala ala ala 
601/201 631/211 

gee gaa ccc aaa tea tea gee ggt ccg atg ttc teg acc tac ggt ate gec teg aca eta 
ala glu pro lys ser ser ala gly pro met phe ser thr tyr gly ile ala ser thr leu 
661/221 691/231 

etc ggc gtg eta teg gtc gee gcg gtc gtg ctg ggt gcg atg. ate tgg tec gca cac cgc 
leu gly val leu ser val ala ala val val leu gly ala met ile trp ser ala his arg 
721/241 751/251 

gat gac tec ggc gag cgt acc tac ctg acc egg gtc atg ctg acc gee get gaa tgg acg 
asp asp ser gly glu arg thr tyr leu thr arg val met leu thr ala ala glu trp thr 
781/261 811/271 

gee gtg ctg ate aac atg aac gee gac aac ate gat gec age ctg cag cga ctg cac gac 
ala val leu ile asn met asn ala asp asn ile asp ala ser leu gin arg leu his asp 
841/281 871/291 

gga acg gtc ggt caa etc aac acc gac ttc gac get gtc gtg cag ccc tac egg cag gtg 
qly thr val gly gin leu asn thr asp phe asp ala val val gin pro tyr arg gin val 
901/301 931/311 

gtg gag aag ttg egg acg cac age age ggc agg ate gag gcg gta gcg ate gat acg gtg 
val glu lys leu arg thr his ser ser gly arg ile glu ala val ala ile asp thr val 
961/321 991/331 

cac cgc gag ctg gat acc cag tec ggt gee gee cga ccg gta gta acc acg aaa ttg cca 
his arg glu leu asp thr gin ser gly ala ala arg pro val val thr thr lys leu pro 
1021/341 1051/351 

ccg ttt gec act cgc acc gac teg gtg ctg ctg gtc gcg acg teg gtc agt gag aac gec 
pro phe ala thr arg thr asp ser val leu leu val ala thr ser val ser glu asn ala 
1081/361 1111/371 

ggc gee aaa ccc cag acc gtg cac tgg aac ttg egg etc gat gtc tec gat gtg gac ggc 
qlv ala lys pro gin thr val his trp asn leu arg leu asp val ser asp val asp gly 
1141/381 1171/391 
aag ctg atg ate tec egg ttg gag teg att cga tga 
lys leu met ile ser arg leu glu ser ile arg) OPA 

SEQ ID |/Vo*>. <-/06-<VC^ 



FIGURE 18E 



67/185 



31/11 

3S> iO lifi. W| GTT GCG CAA CGG GGT GAG CAC CGA CGC GAT GAT GGC GCA ACT ATC GAA ACT GCA GGA CAT 
(val ala qln arg gly glu his arg arg asp asp gly ala thr ile glu thr ala gly his 

^ lDgu> - W 61/21 91/31 

CGC CAA CGC CAA CGA CGG CAC TCG CGC GGT GGG CAC CCC TGG CTA TCA GGC CAG CGT CGA 
arg gin arg gin arg arg his ser arg gly gly his pro trp leu ser gly gin arg arg 
121/41 151/51 

CTA TGT GGT AAA CAC ACT GCG CAA CAG CGG TTT TGA TGT GCA AAC CCC GGA GTT CTC CGC 
leu cys gly lys his thr ala gin gin arg phe^OPACpys ala asn pro gly val leu arg 
■181/61 211/71 -sec&vb^.uo^ 

TCG CGT GTT CAA GGC CGA AAA AGG GGT GGT GAC CCT CGG CGG CAA CAC CGT GGA GGC GAG 
ser arg val gin gly arg lys arg gly gly asp pro arg arg gin his arg gly gly glu 
241/81 271/91 

GGC GCT CGA GTA CAG CCT CGG CAC ACC GCC GGA CGG GGT GAC GGG CCC GCT GGT GGC TGC 
gly ala arg val gin pro arg his thr ala gly arg gly asp gly pro ala gly gly cys 
301/101 331/111 

CCC CGC CGA CGA CAG TCC GGG CTG CAG TCC GTC GGA CTA CGA CAG GCT GCC GGT GTC CGG 
pro arg arg arg gin ser gly leu gin ser val gly leu arg gin ala ala gly val arg 
361/121 

TGC GGT GGT GCT GGT AGA TC 
cys gly gly ala gly arg) 

SEQ ID Mjss qov Hc^ 
FIGURE 19A 



32/11 

\Z> *\o. Cuo TTG CGC AAC GGG GTG AGC ACC GAC GCG ATG ATG GCG CAA CTA TCG AAA CTG CAG GAC ATC 
CfcfoCU Cfi» (leu arg asn gly val ser thr asp ala met met ala gin leu ser lys leu gin asp ile 
V 62/21 92/31 

GCC AAC GCC AAC GAC GGC ACT CGC GCG GTG GGC ACC CCT GGC TAT CAG GCC AGC GTC GAC 
ala asn ala asn asp gly thr arg ala val gly thr pro gly tyr gin ala ser val asp 
122/41 152/51 

TAT GTG GTA AAC ACA CTG CGC AAC AGC GGT TTT GAT GTG CAA ACC CCG GAG TTC TCC GCT 
tyr val val asn thr leu arg asn ser gly phe asp val gin thr pro glu phe ser ala 
182/61 212/71 

CGC GTG TTC AAG GCC GAA AAA GGG GTG GTG ACC CTC GGC GGC AAC ACC GTG GAG GCG AGG 
arg val phe lys ala glu lys gly val val thr leu gly gly asn thr val glu ala arg 
242/81 272/91 

GCG CTC GAG TAG AGC CTC GGC ACA CCG CCG GAC GGG GTG ACG GGC CCG CTG GTG GCT GCC 
ala leu glu tyr ser leu gly thr pro pro asp gly val thr gly pro leu val ala ala 
302/101 332/111 

CCC GCC. GAC GAC AGT CCG GGC TGC AGT CCG TCG GAC TAC GAC AGG CTG CCG GTG TCC GGT 
pro ala asp asp ser pro gly cys ser pro ser asp tyr asp arg leu pro val ser gly 
362/121 

GCG GTG GTG CTG GTA GAT C 
ala val val leu val asp) 



SEQ ID IKC&.QtO -Htl 
FIGURE 19B 
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33/11 



5& 1^*0. ^Z-tgc GCA 


ACG 


GGG 


TGA 


GCA 


CCG 


ACG 


CGA TGA TGG CGC 


AAC 


TAT 


CGA 


AAC 


TGC 


AGG 


ACA 


TCG 


r<s> it> W/3( c y s ala 

63/21 


thr 


giy) 


OPA 


(ala 


pro 


thr 


arg) OPA (trp arg 


asn 


tyr 


arg 


asn 


cys 


arg 


thr 


ser 








-SC&»d«**593/31 


















CCA ACG 


CCA 


ACG 


ACG 


GCA 


CTC 


GCG 


CGG TGG GCA CCC 


CTG 


GCT 


ATC 


AGG 


CCA 


GCG 


TCG 


ACT 


pro thr 
123/41 


pro 


thr 


thr 


ala 


leu 


ala 


arg trp ala pro 


leu 


ala 


ile 


arg 


pro 


ala 


ser 


thr 












153/51 


















ATG TGG 


TAA 


ACA 


CAC 


TGC 


GCA 


ACA 


GCG GTT TTG ATG 


TGC 


AAA 


CCC 


CGG 


AGT 


TCT 


CCG 


CTC 


met trpYoCH 


(thr 


his 


cys 


ala 


thr 


ala val leu met 


cys 


lys 


pro 


arg 


ser 


ser 


pro 


leu 


183/61 








213/71 


















GCG TGT 


TCA 


AGG 


CCG 


AAA 


AAG 


GGG 


TGG TGA CCC TCG 


GCG 


GCA 


ACA 


CCG 


TGG 


AGG 


CGA 


GGG 


ala cys 


ser 


arg 


pro 


lys 


lys gly trp)OPA(pro ser 


ala 


ala 


thr 


pro 


trp 


arg 


arg 


gly 


243/81 












£>££>io 73/91 


















CGC TCG 


AGT 


ACA 


GCC 


TCG 


GCA 


CAC 


CGC CGG ACG GGG 


TGA 


CGG 


GCC 


CGC 


TGG 


TGG 


CTG 


CCC 


arg ser 
303/101 


ser 


thr 


ala 


ser 


ala 


his 


arg arg thr gly) OPA 


(arg 


ala 


arg 


trp 


trp 


leu 


pro 














333/111 












CCG CCG 


ACG 


ACA 


GTC 


CGG 


GCT 


GCA 


GTC CGT CGG ACT 


ACG 


ACA 


GGC 


TGC 


CGG 


TGT 


CCG 


GTG 


pro pro 


thr 


thr 


val 


arg 


ala 


ala 


val arg arg thr 


thr 


thr 


gly 


cys 


arg 


cys 


pro 


val 



363/121 

CGG TGG TGC TGG TAG ATC 
arg trp cys trp)AMB ile 

SEQ ID KPa. 
FIGURE 19C 



part of the nucleotide sequence of seq!9A 
1/1 31/11 



^OmC)*^*^'* CTA TCG 


AAA 


CTG 


CAG 


GAC 


ATC 


GCC 


AAC 


GCC 


AAC GAC 


GGC 


ACT 


CGC 


GCG 


GTG 


GGC 


ACC 


CCT 


£® \C^U£>* HiP(leu ser 
61/21 


lys 


leu 


gin 


asp 


ile 


ala 


asn 


ala 


asn asp 


gly 


thr 


arg 


ala 


val 


gly 


thr 


pro 
















91/31 


















GGC TAT 


CAG 


GCC 


AGC 


GTC 


GAC 


TAT 


GTG 


GTA 


AAC ACA 


CTG 


CGC 


AAC 


AGC 


GGT 


TTT 


GAT 


GTG 


gly tyr 


gin 


ala 


ser 


val 


asp 


tyr 


val 


val 


asn thr 


leu 


arg 


asn 


ser 


gly 


phe 


asp 


val 


121/41 


















151/51 
















GGC 


CAA ACC 


CCG 


GAG 


TTC 


TCC 


GCT 


CGC 


GTG 


TTC 


AAG GCC 


GAA 


AAA 


GGG 


GTG 


GTG 


ACC 


CTC 


gin thr 


pro 


glu 


phe 


ser 


ala 


arg 


val 


phe 


lys ala 


glu 


lys 


gly 


val 


val 


thr 


leu 


gly 


181/61 
















211/71 
















GTG 


GGC AAC 


ACC 


GTG 


GAG 


GCG 


AGG 


GCG 


CTC 


GAG 


TAC AGC 


CTC 


GGC 


ACA 


CCG 


CCG 


GAC 


GGG 


gly asn 


thr 


val 


glu 


ala 


arg 


ala 


leu 


glu 


tyr ser 


leu 


gly 


thr 


pro 


pro 


asp 


gly 


val 


241/81 


















271/91 














GAC 


TAC 


ACG GGC 


CCG 


CTG 


GTG 


GCT 


GCC 


CCC 


GCC 


GAC 


GAC AGT 


CCG 


GGC 


TGC 


AGT 


CCG 


TCG 


thr gly 


pro 


leu 


val 


ala 


ala 


pro 


ala 


asp 


asp ser 


pro 


gly 


cys 


ser 


pro 


ser 


asp 


tyr 


301/101 
















331/111 


















GAC AGG 


CTG 


CCG 


GTG 


TCC 


GGT 


GCG 


GTG 


GTG 


CTG GTA 


GAT 


C 














asp arg 


leu 


pro 


val 


ser 


gly 


ala 


val 


val 


leu val 


asp; 















SEQ ID 

FIGURE 19A 



69/185 



1/1 


















31/11 


















'-TAT CGA 


AAC 


TGC 


AGG 


ACA 


TCG 


CCA 


ACG 


CCA 


ACG ACG 


GCA 


CTC 


GCG 


CGG 


TGG 


GCA 


CCC 


CTG 


^tyr arg 
61/21 


asn 


cys 


arg 


thr 


ser 


pro 


thr 


pro 


thr thr 


ala 


leu 


ala 


arg 


trp 


ala 


pro 


leu 


















91/31 


















GCT ATC 


AGG 


CCA 


GCG 


TCG 


ACT 


ATG 


TGG 


TAA 


ACA CAC 


TGC 


GCA 


ACA 


GCG 


GTT 


TTG 


ATG 


TGC 


ala ile 


arg 


pro 


ala 


ser 


thr 


met 


trp) 


OCH 


(thr his 


cys 


ala 


thr 


ala 


val 


leu 


met 


cys 


121/41 












S6S> ifcK6.V23l51/51 


















AAA CCC 


CGG 


AGT 


TCT 


CCG 


CTC 


GCG 


TGT 


TCA 


AGG CCG 


AAA 


AAG 


GGG 


TGG 


TGA 


CCC 


TCG 


GCG 


- lys pro 


arg 


ser 


ser 


pro 


leu 


ala 


cys 


ser 


arg pro 


lys lys gly trp) 


OPA| 




ser 


ala 


181/61 


















211/71 














GCA ACA 


CCG 


TGG 


AGG 


CGA 


GGG 


CGC 


TCG 


AGT 


ACA GCC 


TCG 


GCA 


CAC 


CGC 


CGG 


ACG 


GGG 


TGA 


ala thr 


pro 


trp 


arg 


arg 


gly 


arg 


ser 


ser 


thr ala 


ser 


ala 


his 


arg 


arg 


thr 


giy) 


OPA 


241/81 


















271/91 


















CGG GCC 


CGC 


TGG 


TGG 


CTG 


CCC 


CCG 


CCG 


ACG 


ACA GTC 


CGG 


GCT 


GCA 


GTC 


CGT 


CGG 


ACT 


ACG 


(arg ala 


arg 


trp 


trp 


leu 


pro 


pro 


pro 


thr 


thr val 


arg 


ala 


ala 


val 


arg 


arg 


thr 


thr 


3 301/101 


















331/111 


















ACA GGC 


TGC 


CGG 


TGT 


CCG 


GTG 


CGG 


TGG 


TGC 


TGG TAG 


ATC 
















thr gly 


cys 


arg 


cys 


pro 


val 


arg 


trp 


cys 


trp) AMB 


ile 

















SEQ ID %ce>. H2i- ^2£> 
FIGURE 19B f 



1/1 

l& ivD ita. WZ<e ATC GAA ACT GCA GGA CAT CGC CAA CGC 

£>Ko.Q27 ^ ie glu thr ala gly his arg gin arg 

CTA TCA GGC CAG CGT CGA CTA TGT GGT 
leu ser gly gin arg arg leu cys gly 
121/41 

AAC CCC GGA GTT CTC CGC TCG CGT GTT 
asn pro gly val leu arg ser arg val 
181/61 

CAA CAC CGT GGA GGC GAG GGC GCT CGA 
gin his arg gly gly glu gly ala arg 
241/81 

GGG CCC GCT GGT GGC TGC CCC CGC CGA 
gly pro ala gly gly cys pro arg arg 
301/101 

CAG GCT GCC GGT GTC CGG TGC GGT GGT 
gin ala ala gly val arg cys gly gly 



31/11 

CAA CGA CGG CAC TCG CGC GGT GGG CAC CCC TGG 
gin arg arg his ser arg gly gly his pro trp 
91/31 

AAA CAC ACT GCG CAA CAG CGG TTT TGA TGT GCA 
lys his thr ala gin gin arg phe) OPA/cys ala 

151/51 SC& ■O to* 

CAA GGC CGA AAA AGG GGT GGT GAC CCT CGG CGG 
gin gly arg lys arg gly gly asp pro arg arg 

211/71 

GTA CAG CCT CGG CAC ACC GCC GGA CGG GGT GAC 
val gin pro arg his thr ala gly arg gly asp 
271/91 

CGA CAG TCC GGG CTG CAG TCC GTC GGA CTA CGA 
arg gin ser gly leu gin ser val gly leu arg 

331/111 
GCT GGT AGA TC 
ala gly arg) 



SEQ ID Kc*>. q£(t**t2g' 
FIGURE 19C' 
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sequence Rv0418 predicted by Cole 
seql9A' 

1/1 

r<&> lt>AD.V^atg gtg aac aaa tec agg atg atg ccg 
SiQlteMSCWet val asn lys ser arg met met pro 
61/21 

ctg acg acg ggc tgt ate egg tgg tct 
leu thr thr gly cys ile arg trp ser 
121/41 

gee gca gag ttc gee gtt gcg ttg cgc 
ala ala glu phe ala val ala leu arg 
181/61 

eta teg aaa ctg cag gac ate gee aac 
leu ser lys leu gin asp ile ala asn 
241/81 

ggc tat cag gec age gtc gac tat gtg 
gly tyr gin ala ser val asp tyr val 
301/101 

caa acc ccg gag ttc tec get cgc gtg 
gin thr pro glu phe ser ala arg val 
361/121 

ggc aac acc gtg gag gcg agg gcg etc 
gly asn thr val glu ala arg ala leu 
421/141 

acg ggc ccg ctg gtg get gee ccc gee 
thr gly pro leu val ala ala pro ala 
481/161 

gac agg ctg ccg gtg tec ggt gcg gtg 
asp arg leu pro val ser gly ala val 
541/181 

cag aag gaa gac gca gee gcg cag cgc 
gin lys glu asp ala ala ala gin arg 
601/201 

gac gag cag gcg atg ggc ggc acc ctg 
asp glu gin ala met gly gly thr leu 
661/221 

agt gtc acc aag teg gtc gga ttc cag 
ser val thr lys ser val gly phe gin 
721/241 

etc acg gcg age acc caa agt ttc aag 
leu thr ala ser thr gin ser phe lys 
781/261 

teg teg gee aac gtg gtg atg gca ggt 
ser ser ala asn val val met ala gly 
841/281 

ate aac gac aac ggc teg gga gtg get 
ile asn asp asn gly ser gly val ala 
901/301 

tea ccg cat gtg tec aac gcg gta egg 
ser pro his val ser asn ala val arg 
961/321 

att ggg tea cga aac tac gtc gag teg 
ile gly ser arg asn tyr val glu ser 



et al. (Nature 393:537-544) and containing 



31/11 

gcg gtg ctg gee gtg get gtg gtc gtc gca ttc 
ala val leu ala val ala val val vai ala phe 
91/31 

acg cag teg egg ccc gtt gtt aac ggc ccc get 
thr gin ser arg pro val val asn gly pro ala 
151/51 

aac egg gtg age acc gac gcg atg atg gcg cac 
asn arg val ser thr asp ala met met ala his 
211/71 

gec aac gac ggc act cgc gcg gtg ggc acc cct 
ala asn asp gly thr arg ala val gly thr pro 
271/91 

gta aac aca ctg cgc aac age ggt ttt gat gtg 
val asn thr leu arg asn ser gly phe asp val 
331/111 

ttc aag gee gaa aaa ggg gtg gtg acc etc ggc 
phe lys ala glu lys gly val val thr leu gly 
391/131 

gag tac age etc ggc aca ccg ccg gac ggg gtg 
glu tyr ser leu gly thr pro pro asp gly *val 
451/151 

gac gac agt ccg ggc tgc agt ccg teg gac tac 
asp asp ser pro gly cys ser ' pro ser asp tyr 
511/171 

gtg ctg gta gat cgc ggc gtc tgt cct ttt gec 
val leu val asp arg gly val cys pro phe ala 
571/191 

ggt gcg gtg gcg ctg ate att get gac aac ate 
gly ala val ala leu ile ile ala asp asn ile 
631/211 

ggg get aat acc gac gtc aag ate ccg gtg gtg 
gly ala asn thr asp val lys ile pro val val 
691/231 

eta cgc gga cag tct ggg cca acc acc gtc aag 
leu arg gly gin ser gly pro thr thr val lys 
751/251 

gee cgc aac gtc ate gcg cag acg aag acg ggg 
ala arg asn val ile ala gin thr lys thr gly 
811/271 

gcg cat ttg gac age gtt ccg gaa gga ccc ggc 
ala his leu asp ser val pro glu gly pro gly 
871/291 

gcg gtt ctg gaa acg gca gtg cag ctg ggg aac 
ala val leu glu thr ala val gin leu gly asn 
931/311 

ttc gee ttc tgg ggc gec gag gaa ttc ggc ctg 
phe ala phe trp gly ala glu glu phe gly leu 
991/331 

ctg gac ate gac gcg etc aaa ggc ate gcg ctg 
leu asp ile asp ala leu lys gly ile ala leu 



SEQ ID Irf^s. V^-^Sb 



FIGURE 19D 



09/485536 



71/185 



1021/341 

tat ctg aac ttc gac 
tyr leu asn phe asp 
1081/361 

cag teg ctg ccg eta 
gin ser leu pro leu 
1141/381 

gag cgc acg ttc gtc 
glu arg thr phe val 
1201/401 

gac ggt egg tec gac 
asp gly arg ser asp 
1261/421 

tec ggc get gag gtc 
ser gly ala glu val 
1321/441 

gag cct ttc gat ccc 
glu pro phe asp pro 
1381/461 

gcg etc ggt ate aac 
ala leu gly ile asn 
1441/481 

ggc ggc ccc aac ggg 
gly gly pro asn gly 
1501/501 
■ tga 
OPA 



atg 
met 


ttg 
leu 


gcg 
ala 


teg 
ser 


ccg 
pro 


gac 
asp 


gcc 
ala 


cgc 
arg 


ggt 

gly 


cag 
gin 


gcc 
ala 


tat 
tyr 


ctg 
leu 


aag 
lys 


atg 
met 


tac 
tyr 


gac 
asp 


ggc 

gly 


ttc 
phe 


acg 
thr 


aag 
lys 


aag 
lys 


tec 
ser 


gcc 
ala 


gag 
glu 


aac 
asn 


tat 
tyr 


cac 
his 


cag 
gin 


aag 
lys 


ggc 
gly 


get 
ala 


ggc 
gly 


gtc 
val 


gcg 
ala 


gtt 
val 


ccg 
pro 


gtc 
val 


atg 
met 


gcg 
ala 



1051/351 

aac ccg ggt tac ttc 
asn pro gly tyr phe 
1111/371 

ccg gtg gtg ccc gaa 
pro val val pro glu 
1171/391 

gcc ggc aag acc, gcg 
ala gly lys thr ala 
1231/411 

ctg gcg ggt ate cct 
leu ala gly ile pro 
1291/431 

caa gcc gag etc tgg 
gin ala glu leu trp 
1351/451 

aca gac acc ctg gac 
thr asp thr leu asp 
1411/471 

tac gcg gtg ggt ttg 
tyr ala val gly leu 
1471/491 

gac cgc acc cgc cac 
asp arg thr arg his 



acc tac gac ggt gac 
thr tyr asp gly asp 

ggc teg gcc ggt ate 
gly ser ala gly ile 

cag gac acc teg ttc 
gin asp thr ser phe 

teg ggt ggc ctg ttc 
ser gly gly leu phe 

ggc ggc acc gcc gac 
gly gly thr ala asp 

cat ate gac cgc acc 
his ile asp arg thr 

tat gcg cag gac etc 
tyr ala gin asp leu 

ctg att gcc aaa ccg^ 
leu ile ala lys pro 



SEQ ID i\az>M23 -<£3Z>( continued) 
FIGURE 19D (continued) 



72/185 



ORF according to Cole et al . (Nature 
1/1 

i&> rt> ^31-tag gcc att caa cgc tct gtt cgt ttg 
AMRj(ala ile gin arg ser val arg leu 
tbrto. ^2 61/21 

cag gcg egg tct aat ctg ggc gcg atg 
gin ala arg ser asn leu gly ala met 
121/41 

gcc gtg get gtg gtc gtc gca ttc ctg 
ala val ala val val val ala phe leu 
181/61 

egg ccc gtt gtt aac ggc ccc get gcc 
arg pro val val asn gly pro ala ala 
241/81 

age acc gac gcg atg atg gcg cac eta 
ser thr asp ala met met ala his leu 
301/101 

ggc act cgc gcg gtg ggc acc cct ggc 
gly thr arg ala val gly thr pro gly 
361/121 

ctg cgc aac age ggt ttt gat gtg caa 
leu arg asn ser gly phe asp val gin 
421/141 

gaa aaa ggg gtg gtg acc etc ggc ggc 
glu lys gly val val thr leu gly gly 
481/161 

etc ggc aca ccg ccg gac ggg gtg acg 
leu gly thr pro pro asp gly val thr 
541/181 

ccg ggc tgc agt ccg teg gac tac gac 
pro gly cys ser pro ser asp tyr asp 
601/201 

gat cgc ggc gtc tgt cct ttt gcc cag 
asp arg gly val cys pro phe ala gin 
661/221 

gcg ctg ate att get gac aac ate gac 
ala leu ile ile ala asp asn ile asp 
721/241 

acc gac gtc aag ate ccg gtg gtg agt 
thr asp val lys ile pro val val ser 
781/261 

cag tct ggg cca acc acc gtc aag etc 
gin ser gly pro thr thr val lys leu 
841/281 

gtc ate gcg cag acg aag acg ggg teg 
val ile ala gin thr lys thr gly ser 
901/301 

gac age gtt ccg gaa gga ccc ggc ate 
asp ser val pro glu gly pro gly ile 
961/321 

gaa acg gca gtg cag ctg ggg aac tea 
glu thr ala val gin leu gly asn ser 



393:537-544) and containing Rv0418 
31/11 

att ggt egg tgg gat gcg aaa get gcg egg cga 
ile gly arg trp asp ala lys ala ala arg arg 
91/31 

gtg aac aaa tec agg atg atg ccg gcg gtg ctg 
val asn lys ser arg met met pro ala val leu 
151/51 

acg acg ggc tgt ate egg tgg tct acg cag teg 
thr thr gly cys ile arg trp ser thr gin ser 
211/71 

gca gag ttc gcc gtt gcg ttg cgc aac egg gtg 
ala glu phe ala val ala leu arg asn arg val 
271/91 

teg aaa ctg cag gac ate gcc aac gcc aac gac 
ser lys leu gin asp ile ala asn ala asn asp 
331/111 

tat cag gcc age gtc gac tat gtg gta aac aca 
tyr gin ala ser val asp tyr val val asn thr 
391/131 

acc ccg gag ttc tec get cgc. gtg ttc aag gcc 
thr pro glu phe ser ala arg val phe lys ala 
451/151 

aac acc gtg gag gcg agg gcg etc gag' tac 'age 
asn thr val glu ala arg ala leu glu tyr ser 
511/171 

ggc ccg ctg gtg get gcc ccc gcc gac gac agt 
gly pro leu val ala ala pro ala asp asp ser 
571/191 

agg ctg ccg gtg tec ggt gcg gtg gtg ctg gta 
arg leu pro val ser gly ala val val leu val 
631/211 

aag gaa gac gca gcc gcg cag cgc ggt gcg gtg 
lys glu asp ala ala ala gin arg gly ala val 
691/231 

gag cag gcg atg ggc ggc acc ctg ggg get aat 
glu gin ala met gly gly thr leu gly ala asn 
751/251 

gtc acc aag teg gtc gga ttc cag eta cgc gga 
val thr lys ser val gly phe gin leu arg gly 
811/271 

acg gcg age acc caa agt ttc aag gcc cgc aac 
thr ala ser thr gin ser phe lys ala arg asn 
871/291 

teg gcc aac gtg gtg atg gca ggt gcg cat ttg 
ser ala asn val val met ala gly ala his leu 
931/311 

aac gac aac ggc teg gga gtg get gcg gtt ctg 
asn asp asn gly ser gly val ala ala val leu 
991/331 

ccg cat gtg tec aac gcg gta egg ttc gcc ttc 
pro his val ser asn ala val arg phe ala phe 



SEQ ID >U>». M3I 



FIGURE 19E 



73/185 



1021/341 1051/351 

tgg ggc gcc gag gaa ttc ggc ctg att ggg tea cga aac tac gtc gag teg ctg gac ate 
trp gly ala glu glu phe gly leu ile gly ser arg asn tyr val glu ser leu asp ile 
1081/361 1111/371 

gac gcg etc aaa ggc ate gcg ctg tat ctg aac ttc gac atg ttg gcg teg ccg aac ccg 
asp ala leu lys gly ile ala leu tyr leu asn phe asp met leu ala ser pro asn pro 
1141/381 1171/391 

ggt tac ttc ace tac gac ggt gac cag teg ctg ccg eta gac gcc cgc ggt cag ccg gtg 
gly tyr phe thr tyr asp gly asp gin ser leu pro leu asp ala arg gly gin pro val 
1201/401 1231/411 

gtg ccc gaa ggc teg gcc ggt ate gag cgc acg ttc gtc gcc tat ctg aag atg gcc ggc 
val pro glu gly ser ala gly ile glu arg thr phe val ala tyr leu lys met ala gly 
1261/421 1291/431 

aag ace gcg cag gac ace teg ttc gac ggt egg tec gac tac gac ggc ttc acg ctg gcg 
lys thr ala gin asp thr ser phe asp gly arg ser asp tyr asp gly phe thr leu ala 
1321/441 1351/451 

ggt ate cct teg ggt ggc ctg ttc tec ggc get gag gtc aag aag tec gcc gag caa gcc 
gly ile pro ser gly gly leu phe ser gly ala glu val lys lys ser ala glu gin ala 
1381/461 1411/471 

gag etc tgg ggc ggc ace gcc gac gag cct ttc gat ccc aac tat cac cag aag aca gac 
glu leu trp gly gly thr ala asp glu pro phe asp pro asn tyr his gin lys thr asp 
1441/481 1471/491 

ace ctg gac cat ate gac cgc ace gcg etc ggt ate aac ggc get ggc gtc gcg tac gcg 
thr leu asp his ile asp arg thr ala leu gly ile asn gly ala gly val ala tyr ala 
1501/501 1531/511 

gtg ggt ttg tat gcg cag gac etc ggc ggc ccc aac ggg gtt ccg gtc atg gcg gac cgc 
val gly leu tyr ala gin asp leu gly gly pro asn gly val pro val met ala asp arg 
1561/521 " * ' ' ' ........ .... 

ace cgc cac ctg att gcc aaa ccg tga 
thr arg his leu ile ala lys pro) OPA 

SEQ ID Ho*. ^si - usz (continued) 
FIGURE 19E (continued) 



31/11 

CCS VS3-CGA GAC AGT GGT GCG GGA CAC TTG AGT TCG GCT GCT AAC GAC GCC AGA GTC GCC CGC TTC 

4rg asp ser gly ala gly his leu ser ser ala ala asn asp ala arg val ala arg phe 

£fi> 10*0.43^61/21 91/31 

CGC GGT GTG GGA CTC ACG TTC GGT GAG GGT ACA GCG GAC CTT CGA GGA CGC AAT ATC GTG 
arg gly val gly leu thr phe gly glu gly thr ala asp leu arg ala arg asn ile val 
121/41 151/51 

GGC CGG CTG GGA ACC GTC GGT TTC GAC GTT GGT GAC GAC CCC TCG TTC ATG AAT CGT TCT 
gly arg leu ala thr val gly phe asp val gly asp asp pro ser phe met asn arg ser) 
181/61 211/71 

TGA GCT CCC CGT TTT GCT GGA TGG CCA GGC ACC GCC GGT ACT GCT GCG CTT AAG CTT GTC 
OPAjala pro arg phe ala gly cys pro gly thr ala gly thr ala ala leu lys leu val 
£G>.D*o. §41/81 ' 271/91 

GCA CAT GGT GCC GGC AGG GAG GAA CAG TGG GCA AGC AGC TAG CCG CGC TCG CCG CGC TGG 
ala his gly ala gly arg glu glu gin trp ala ser ser)AMB(pro arg ser pro arg trp 
301/101 331/111 <5>e<S to to- <-i30? 

TCG GTG CGT GCA TGC TCG CAG CCG GAT GCA CCA ACG TGG TCG ACG GGA CCG CCG TGG CTG 
ser val arg ala cys ser gin pro asp ala pro thr trp ser thr gly pro pro trp leu 
361/121 

CCG ACA AAT CCG GAC CAC TGC ATC AGG ATC 
pro thr asn pro asp his cys ile arg ile] 

SEQ ID <4&&- </3<b 



FIGURE 20A 



74/185 



32/11 

tt> MO. GAG ACA GTG GTG CGG GAC ACT TGA GTT CGG CTG CTA ACG ACG CCA GAG TCG CCC GCT TCC 

fglu thr val val arg asp thr\OPA(yal arg leu leu thr thr pro glu ser pro ala ser 
\ 2 / 21 3Cfi>iO 92 ' 31 

GCG GTG TGG GAC TCA CGT TCG GTG AGG GTA CAG CGG ACC TTC GAG CAC GCA ATA TCG TGG 
ala val trp asp ser arg ser val arg val gin arg thr phe glu his ala ile ser trp 
122/41 152/51 

GCC GGC TGG CAA CCG TCG GTT TCG ACG TTG GTG ACG ACC CCT CGT TCA TGA ATC GTT CTT 
ala gly trp gin pro ser val ser thr leu val thr thr pro arg ser) OPA file val leu 
182/61 212/71 S6&> tt> Ko. MH£> 

GAG CTC CCC GTT TTG CTG GAT GCC CAG GCA CCG CCG GTA CTG CTG CGC TTA AGC TTG TCG 
glu leu pro val leu leu asp ala gin ala pro pro val leu leu arg leu ser leu ser 
242/81 272/91 

CAC ATG GTG CCG GCA GGG AGG AAC AGT GGG CAA GCA GCT AGC CGC GCT CGC CGC GCT GGT 
his met val pro ala gly arg asn ser gly gin ala ala ser arg ala arg arg ala gly 
302/101 332/111 

CGG TGC GTG CAT GCT CGC AGC CGG ATG CAC CAA CGT GGT CGA CGG GAC CGC CGT GGC TGC 
arg cys val his ala arg ser arg met his gin arg gly arg arg asp arg arg gly cys 
362/121 

CGA CAA ATC CGG ACC ACT GCA TCA GGA TC 
arg gin ile arg thr thr ala ser gly) 



SEQ ID tfos. 437- V</D 
FIGURE 20B 



33/11 

C<$> iO /o. UHl AGA CAG TGG TGC GGG ACA CTT GAG TTC GGC TGC TAA CGA CGC CAG AGT CGC CCG CTT CCG 
(arg gin trp cys gly thr leu glu phe gly cys) OCH (arg arg gin ser arg pro leu pro 

QS> lOiteMUZ 6 3/21 93/31^^,0*0. </<<5 

CGG TGT GGG ACT CAC GTT CGG TGA GGG TAC AGC GGA CCT TCG AGC ACG CAA TAT CGT GGG 
arg cys gly thr his val arg)oPA(gly tyr ser gly pro ser ser thr gin tyr arg gly 
123/41 5cS) £>*O.V*£> 153/51 

CCG GCT GGC AAC CGT CGG TTT CGA CGT TGG TGA CGA CCC CTC GTT CAT GAA TCG TTC TTG 
pro ala gly asn arg arg phe arg arg trp^OPA(arg pro leu val his glu ser phe leu 
183/61 213/71 ^C<$> HO- 

AGC TCC CCG TTT TGC TGG ATG CCC AGG CAC CGC CGG TAC TGC TGC GCT TAA GCT TGT CGC 
ser ser pro phe cys trp met pro arg his arg arg tyr cys cys ala) OCH £la cys arg 
243/81 273/91 5££> U^Vto.^fc? 

ACA TGG TGC CGG CAG GGA GGA ACA GTG GGC AAG CAG CTA GCC GCG CTC GCC GCG CTG GTC 
thr trp cys arg gin gly gly thr val gly lys gin leu ala ala leu ala ala leu val 
303/101 333/111 

GGT GCG TGC ATG CTC GCA GCC GGA TGC ACC AAC GTG GTC GAC GGG ACC GCC GTG GCT GCC 
gly ala cys met leu ala ala gly cys thr asn val val asp gly thr ala val ala ala 
363/121 

GAC AAA TCC GGA CCA CTG CAT CAG GAT C 
asp lys ser gly pro leu his gin asp^ 



SEQ ID KDs>.qt/*-W63 
FIGURE 20C 



75/185 



part of the nucleotide sequence of seq20A 

1/1 31/11 
EX£> iO to. CfV/TGT GGG ACT CAC GTT CGG TGA GGG TAC AGC GGA CCT TCG AGC ACG CAA TAT CGT GGG CCG 
^ fcvs qly thr his val arg^OPAfgly tyr ser gly pro ser ser: thr gin tyr arg gly pro 

»8^wl 6 { /21 J WifcH*.** 91/31 

GCT GGC AAC CGT CGG TTT CGA CGT TGG TGA CGA CCC CTC GTT CAT GAA TCG TTC TTG AGC 
ala gly asn arg arg phe arg arg trp^OPA (arg pro leu val his glu ser phe leu ser 
121/41 :5£& iDKo. V50151/51 

TCC CCG TTT TGC TGG ATG CCC AGG CAC CGC CGG TAC TGC TGC GCT TAA GCT TGT CGC ACA 
ser pro phe cys trp met pro arg his arg arg tyr cys cys; ala)oCH (ala cys arg thr 
181/61 211/71 tse^/DKo. ^ 

TGG TGC CGG CAG GGA GGA ACA GTG GGC AAG CAG CTA GCC GCG CTC GCC GCG CTG GTC GGT 
trp cys arg gin gly gly thr val gly lys gin leu ala ala leu ala ala leu val gly 
241/81 271/91 

GCG TGC ATG CTC GCA GCC GGA TGC ACC AAC GTG GTC GAC GGG ACC GCC GTG GCT GCC GAC 
ala cys met leu ala ala gly cys thr asn val val asp gly thr ala val ala ala asp 
301/101 

AAA TCC GGA CCA CTG CAT CAG GAT C 
lys ser gly pro leu his gin aspj 

SEQ ID Kctt.^*7-^&1 
FIGURE 20A f 



VDUo- 45Z-GTG GGA CTC ACG TTC GGT GAG GGT ACA GCG 
<S> i£> *to462(val gly leu thr phe gly glu gly thr ala 
61/21 

CTG GCA ACC GTC GGT TTC GAC GTT GGT GAC 
leu ala thr val gly phe asp val gly asp 
121/41 

CCC CGT TTT GCT GGA TGC CCA GGC ACC GCC 
pro arg phe ala gly cys pro gly thr ala 
181/61 

GGT GCC GGC AGG GAG GAA CAG TGG GCA AGC 
gly ala gly arg glu glu gin trp ala ser 
241/81 

CGT GCA TGC TCG CAG CCG GAT GCA CCA ACG 
arg ala cys ser gin pro asp ala pro thr 
301/101 

AAT CCG GAC CAC TGC ATC AGG ATC 
asn pro asp his cys ile arg ile} 



31/11 

GAC CTT CGA GCA CGC AAT ATC GTG GGC CGG 
asp leu arg ala arg asn ile val gly arg 
91/31 

GAC CCC TCG TTC ATG AAT CGT TCT TGA GCT 
asp pro ser prie met asn arg serj OPAfala 
151/51 ^>e<$> li^W^.^V 

GGT ACT GCT GCG CTT AAG CTT GTC GCA CAT 
gly thr ala ala leu lys leu val ala his 
211/71 

AGC TAG CCG CGC TCG CCG CGC TGG TCG GTG 
ser)AMB(pro arg ser pro arg trp ser val 
271/91 ->£<$> iO K^>. <*SS 

TGG TCG ACG GGA CCG CCG TGG CTG CCG ACA 
trp ser thr gly pro pro trp leu pro thr 



SEQ ID Kos. <*52.~ 
FIGURE 20B f 



1/1 
















£<$ \rD Wo. i>t&tH~TC TGG 


GAC 


TCA 


CGT 


TCG 


GTG 


AGG 


GTA 




asp 


se r 


ara 


ser 


val 


ara 


val 














CAA 


CCG 


TCG 


GTT 


TCG 


ACG 


TTG 


gly trp 


gin 


pro 


se r 


val 


ser 


thr 


leu 


121/41 
















CTC CCC 


GTT 


TTG 


CTG 


GAT 


GCC 


CAG 


GCA 


leu pro 


val 


leu 


leu 


asp 


ala 


gin 


ala 


181/61 
















ATG GTG 


CCG 


GCA 


GGG 


AGG 


AAC 


AGT 


GGG 


met val 


pro 


ala 


gly 


arg 


asn 


ser 


gly 


241/81 
















TGC GTG 


CAT 


GCT 


CGC 


AGC 


CGG 


ATG 


CAC 


cys val 


his 


ala 


arg 


ser 


arg 


met 


his 


301/101 
















CAA ATC 


CGG 


ACC 


ACT 


GCA 


TCA 


GGA 


TC 


gin ile 


arg 


thr 


thr 


ala 


ser 


giy' 


> 



76/185 





31/11 


















CAG 


CGG ACC 


TTC 


GAG 


CAC 


GCA 


ATA 


TCG 


TGG 


GCC 


gin 


arg thr 


phe 


glu 


his 


ala 


ile 


ser 


trp 


ala 




91/31 


















GTG 


ACG ACC 


CCT 


CGT 


TCA 


TGA 


ATC 


GTT 


CTT 


GAG 


val 


thr thr 


pro 


arg 


ser ] 


I OP A 


(ile 


val 


leu 


glu 




151/51 














CCG 


CCG GTA 


CTG 


CTG 


CGC 


TTA 


AGC 


TTG 


TCG 


CAC 


pro 


pro val 


leu 


leu 


arg 


leu 


ser 


leu 


ser 


his 




211/71 


















CAA 


GCA GCT 


AGC 


CGC 


GCT 


CGC 


CGC 


GCT 


GGT 


CGG 


gin 


ala ala 


ser 


arg 


ala 


arg 


arg 


ala 


gly 


arg 




271/91 


















CAA 


CGT GGT 


CGA 


CGG 


GAC 


CGC 


CGT 


GGC 


TGC 


CGA 


gin 


arg gly 


arg 


arg 


asp 


arg 


arg 


gly cys 


arg 



SEQ ID K^b. 45<o - L d=>* 
FIGURE 20C f 



sequence Rv357 6 predicted by Cole et al . (Nature 393:537-544) and containing seq20A* 

1/1 31/11 
>eS> iD^-^atg ggc aag cag eta gec gcg etc gec gcg ctg gtc ggt gcg tgc atg etc gca gec gga 

' - • * ^(met gly lys gin leu ala ala leu ala ala leu val gly ala cys met leu ala , ala gly , 

» S>**** p0l 61/ 2i 91/31 

tgc acc aac gtg gtc gac ggg acc gec gtg get gee gac aaa tec gga cca ctg cat cag 

cys thr asn val val asp gly thr ala val ala ala asp lys ser gly pro leu his gin 

121/41 151/51 

gat ccg ata ccg gtt tea gcg ctt gaa ggg ctg ctt etc gac ttg age cag ate aat gee 
asp pro ile pro val ser ala leu glu gly leu leu leu asp leu ser gin ile asn ala 
181/61 211/71 

gcg ctg ggt gcg aca teg atg aag gtg tgg ttc aac gee aag gca atg tgg gac tgg age 
ala leu gly ala thr ser met lys val trp phe asn ala lys ala met trp asp trp ser 
241/81 271/91 

aag age gtg gec gac aag aat tgc ctg get ate gac ggt cca gca cag gaa aag gtc tat 
lys ser val ala asp lys asn cys leu ala ile asp gly pro ala gin glu lys val tyr 
301/101 331/111 

gec ggc acc ggg tgg acc get atg cgc ggc caa egg ctg gat gac age ate gat gac tec 
ala gly thr gly trp thr ala met arg gly gin arg leu asp asp ser ile asp asp ser 
361/121 391/131 

aag aaa cgc gac cac tac gec att caa gcg gtc gtc ggc ttc ccg acc gca cat gat gee 
lys lys arg asp his tyr ala ile gin ala val val gly phe pro thr ala his asp ala 
421/141 451/151 

gag gag ttc tac age tec teg gtg caa age tgg age age tgc teg aac cgc egg ttt gtc 
glu glu phe tyr ser ser ser val gin ser trp ser ser cys ser asn arg arg phe val 
481/161 511/171 

gaa gtc acc ccc gga cag gac gac gec gee tgg act gtg get gac gtt gtc aac gac aac 
glu val thr pro gly gin asp asp ala ala trp thr val ala asp val val asn asp asn 
541/181 571/191 

ggc atg etc agt age teg cag gtt- cag gaa ggc ggc gac gga tgg acc tgc cag cgt gee 
gly met leu ser ser ser gin val gin glu gly gly asp gly trp thr cys gin arg ala 

601/201 631/211 

ctg act gcg cgc aac aac gtc act ate gac att gtc acg tgc gec tat age caa ccg gat 
leu thr ala arg asn asn val thr ile asp ile val thr cys ala tyr ser gin pro asp 
661/221 691/231 

ttg gtg gcg att ggc ate get aac caa ate gcg gec aag gtt get aag cag tag 
leu val ala ile gly ile ala asn gin ile ala ala lys val ala lys glnjAMB 

SEQ ID Kos. </69-</<pO 



FIGURE 20D 



77/185 



ORF according to Cole et al. (Nature 393:537-544) and containing Rv3576 

1/1 31/11 
e<s> O *o. M(*>( taa act tgt cgc aca tgg tgc egg cag gga gga aca gtg ggc aag cag eta gec gcg etc 

OCHjala cys arg thr trp cys arg gin gly gly thr val gly lys gin leu ala ala leu 
<S> <~t(p£ eiTll 91/31 

gec gcg ctg gtc ggt gcg tgc atg etc gca gee gga tgc acc aac gtg gtc gac ggg acc 

ala ala leu val gly ala cys met leu ala ala gly cys thr asn val val asp gly thr 
. 121/41 151/51 

gee gtg get gec gac aaa tec gga cca ctg cat cag gat ccg ata ccg gtt t'ca gcg ctt 

ala val ala ala asp lys ser gly pro leu his gin asp pro ile pro val ser ala leu 

181/61 211/71 

gaa ggg ctg ctt etc gac ttg age cag ate aat gec gcg ctg ggt gcg aca teg atg aag 
glu gly leu leu leu asp leu ser gin ile asn ala ala leu gly ala thr ser met lys 
241/81 271/91 

gtg tgg ttc aac gee aag gca atg tgg gac tgg age aag age gtg gee gac aag aat tgc 
val trp phe asn ala lys ala met trp asp trp ser lys ser val ala asp lys asn cys 
301/101 331/111 

ctg get ate gac ggt cca gca cag gaa aag gtc tat gee ggc acc ggg tgg acc get atg 
leu ala ile asp gly pro ala gin glu lys val tyr ala gly thr gly trp thr ala met 
361/121 391/131 

cgc ggc caa egg ctg gat gac age ate gat gac tec aag aaa cgc gac cac tac gec att 
arg gly gin arg leu asp asp ser ile asp asp ser lys lys arg asp his tyr ala ile 
421/141 451/151 
" ' caa' gcg "gtc gtc ggc ttc ccg acc gca cat gat gee gag gag ttc tac age tec teg gtg 

gin ala val val gly phe pro thr ala his asp ala glu glu phe tyr ser ser ser val 
481/161 511/171 

caa age tgg age age tgc teg aac cgc egg ttt gtc gaa gtc acc ccc gga cag gac gac 
gin ser trp ser ser cys ser asn arg arg phe val glu val thr pro gly gin asp asp 
541/181 571/191 

gee gee tgg act gtg get gac gtt gtc aac gac aac ggc atg etc agt age teg cag gtt 
ala ala trp thr val ala asp val val asn asp asn gly met leu ser ser ser gin val 
601/201 631/211 

cag gaa ggc ggc gac gga tgg acc tgc cag cgt gec ctg act gcg cgc aac aac gtc act 
gin glu gly gly asp gly trp thr cys gin arg ala leu thr ala arg asn asn val thr 
661/221 691/231 

ate gac att gtc acg tgc gee tat age caa ccg gat ttg gtg gcg att ggc ate get aac 
ile asp ile val thr cys ala tyr ser gin pro asp leu val ala ile gly ile ala asn 
721/241 

caa ate gcg gee aag gtt get aag cag tag 
gin ile ala ala lys val ala lys gin) AMB 



SEQ ID Res- -HLoZ 
FIGURE 20F 



78/185 



1/1 31/11 
I'D Afc. H(c3 GTC CTG GTC GCC GCG CAA CTG GCC GGT CCC GAT GGA AAG TGT. TCA CGA TCG CGC TTC TGC 

*A> it> Kr> U&tfo^ ^ eu val ala ala gln leu a ^ a P ro as P ^ vs cvs ser ar 9 ser ar 9 P ne 

61/21 91/31 

CGC TGG TAG TGG CGA TGG TGT TAG CAG GAT TGC GGG TCG AGG CTG CGA TGG CCA GCA CCA 

arg trp] AMB (trp arg trp cys^AMB/gln asp cys gly ser arg leu arg trp pro ala pro 

121/41 S£<5> &\A&' C ((0t> S-£s>iD t*o.V<^Sl/51 

GCG GCC TGC GGC TGG TCG CCG CGC GCG CCG AAA TGA TAC CCG CGA TCA CGA AAT ACA TGT 

ala ala cys gly trp ser pro arg ala pro lysj OPA(tyr pro arg ser arg asn thr cys 

181/61 211/71 srs> i£> (to. 4G>7 

CGG CGC TGG ACG TCG CCG TGC TGG CCA GCT CGA CCG GAC ACG ATG TGG AGG GGG CGC AGA 

arg arg trp thr ser pro cys trp pro ala arg pro asp thr met trp arg gly arg arg 

241/81 271/91 

AAA ACT TCA CCG CCC GCA AGT ACG AGC TGC AGA CGC GAC TGG . CCG ACA CCG ACG TCA TCG 

lys thr ser pro pro ala ser thr ser cys arg arg asp trp pro thr pro thr ser ser 

301/101 331/111 

CAG ACG TGC GGT CGG GAG TGA ACA CGC TGC TCA ACG GCG GTC AGG CGC TGC TGG ATA AGA 

gin thr cys gly arg glu) OPA^thr arg cys ser thr ala val arg arg cys trp ile arg 

361/121 ^>e& ifc y*~*>. 

TGC TGG CCG ACA GCA TCG GCT TGC GGG ATC 

cys trp pro thr ala ser ala cys gly ile) 

SEQ ID Kos. HUS'H(x>^ 
FIGURE 21A 





















32/11 


















TCG 


CCG 


CGC 


AAC 


TGG 


CCG 


GTC 


CCG 


ATG GAA 


AGT 


GTT 


CAC 


GAT 


CGC 


GCT 


TCT 


GCC 


l& i£> to,</7o( ser tr P 


ser 


pro 


arg 


asn 


trp 


pro 


val 


pro 


met glu 


ser 


val 


his 


asp 


arg 


ala 


ser 


ala 


62/21 


















92/31 


















GCT GGT 


AGT 


GGC 


GAT 


GGT 


GTT 


AGC 


AGG 


ATT 


GCG GGT 


CGA 


GGC 


TGC 


GAT 


GGC 


CAG 


CAC 


CAG 


ala gly 


ser 


gly 


asp 


gly 


val 


ser 


arg 


ile 


ala gly 


arg 


gly 


cys 


asp 


gly 


gin 


his 


gin 


122/41 


















152/51 


















CGG CCT 


GCG 


GCT 


GGT 


CGC 


CGC 


GCG 


CGC 


CGA 


AAT GAT 


ACC 


CGC 


GAT 


CAC 


GAA 


ATA 


CAT 


GTC 


arg pro 


ala 


ala 


gly 


arg 


arg 


ala 


arg 


arg 


asn asp 


thr 


arg 


asp 


his 


glu 


ile 


his 


val 


182/61 


















212/71 


















GGC GCT 


GGA 


CGT 


CGC 


CGT 


GCT 


GGC 


CAG 


CTC 


GAC CGG 


ACA 


CGA 


TGT 


GGA 


GGG 


GGC 


GCA 


GAA 


gly ala 


gly 


arg 


arg 


arg 


ala 


gly 


gin 


leu 


asp arg 


thr 


arg 


cys 


gly 


gly 


gly 


ala 


glu 


242/81 


















272/91 


















AAA CTT 


CAC 


CGC 


CCG 


CAA 


GTA 


CGA 


GCT 


GCA 


GAC GCG 


ACT 


GGC 


CGA 


CAC 


CGA 


CGT 


CAT 


CGC 


lys leu 


his 


arg 


pro 


gin 


val 


arg 


ala 


ala 


asp ala 


thr 


gly 


arg 


his 


arg 


arg 


his 


arg 


302/101 


















332/111 


















AGA CGT 


GCG 


GTC 


GGG 


AGT 


GAA 


CAC 


GCT 


GCT 


CAA CGG 


CGG 


TCA 


GGC 


GCT 


GCT 


GGA 


TAA 


GAT 


arg arg 


ala 


val 


gly 


ser 


glu 


his 


ala 


ala 


gin arg 


arg 


ser 


gly 


ala 


ala 


gly) OCH, 


(asp 


_3.62/_121 
































secs> id Ac- 


GCT GGC 


CGA 


CAG 


CAT 


CGG 


CTT 


GCG 


GGA 


TC 




















ala gly 


arg 


gin 


his 


arg 


leu 


ala 


giy) 























SEQ ID R£>s. *f<*9 



FIGURE 21B 
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^ 72 CCT GGT 


CGC 


CGC 


GCA 


ACT 


GGC 


CGG 


TCC 


CGA 


TGG AAA 


GTG 


TTC 


ACG 


ATC 


GCG 


CTT 


CTG 


CCG 


wvofpro giy 


arg 


a rg 


ax a 


uxir 


gly 


a rg 


ser 


arg 


trp lys 
y o/ oi 


val 


phe 


thr 


ile 


ala 


leu 


leu 


pro 


CTG GTA 


GTG 


GCG 


ATG 


GTG 


TTA 


GCA 


GGA 


TTG 


CGG GTC 


GAG 


GCT 


GCG 


ATG 


GCC 


AGC 


ACC 


AGC 


leu val 


val 


ala 


met 


val 


leu 


ala 


gly 


leu 


arg val 

1 /CI 

loo/ Ol 


glu 


ala 


ala 


met 


ala 


ser 


thr 


ser 


GGC CTG 


CGG 


CTG 


GTC 


GCC 


GCG 


CGC 


GCC 


GAA. 


ATG ATA 


CCC 


GCG 


ATC 


ACG 


AAA 


TAC 


ATG 


TCG 


gly leu 

XD J/ Ol 


arg 


leu 


val 


ala 


ala 


arg 


ala 


glu 


met ile 

£.±6/ / 1 


pro 


ala 


ile 


thr 


lys 


tyr 


met 


ser 


'GCG CTG 


GAC 


GTC 


GCC 


GTG 


CTG 


GCC 


AGP 


TPG 


APP GGA 


GAP 






r- A f~* 




/"■/"■/-• 


CAG 


AAA 


ala leu 
243/81 


asp 


val 


ala 


val 


leu 


ala 


ser 


ser 


thr gly 
273/91 


his 


asp 


val 


glu 


gly 


ala 


gin 


lys 


AAC TTC 


ACC 


GCC 


CGC 


AAG 


TAC 


GAG 


CTG 


CAG 


ACG CGA 


CTG 


GCC 


GAC 


ACC 


GAC 


GTC 


ATC 


GCA 


asn phe 
303/101 


thr 


ala 


arg 


lys 


tyr 


glu 


leu 


gin 


thr arg 
333/111 


leu 


ala 


asp 


thr 


asp 


val 


ile 


ala 


GAC GTG 


CGG 


TCG 


GGA 


GTG 


AAC 


ACG 


CTG 


CTC 


AAC GGC 


GGT 


CAG 


GCG 


CTG 


CTG 


GAT 


AAG 


ATG 


asp val 
363/121 


arg 


ser 


gly 


val 


asn 


thr 


leu 


leu 


asn gly 


gly 


gin 


ala 


leu 


leu 


asp 


lys 


met 


CTG GCC 


GAC 


AGC 


ATC 


GGC 


TTG 


CGG 


GAT 


C 




















leu ala 


asp 


ser 


ile 


gly 


leu 


arg 


asp) 























SEQ ID v^Mlt-'WS 
FIGURE 21C 



part of the nucleotide sequence of seq21A 





















31/11 


















ACG ATC 


GCG 


CTT 


CTG 


CCG 


CTG 


GTA 


GTG 


GCG 


ATG GTG 


TTA 


GCA 


GGA 


TTG 


CGG 


GTC 


GAG 


GCT 


(thr ile 


ala 


leu 


leu 


pro 


leu 


val 


val 


ala 


met val 


leu 


ala 


gly 


leu 


arg 


val 


glu 


ala 


61/21 


















91/31 


















GCG ATG 


GCC 


AGC 


ACC 


AGC 


GGC 


CTG 


CGG 


CTG 


GTC GCC 


GCG 


CGC 


GCC 


GAA 


ATG 


ATA 


CCC 


GCG 


ala met 


ala 


ser 


thr 


ser 


gly 


leu 


arg 


leu 


val ala 


ala 


arg 


ala 


glu 


met 


ile 


pro 


ala 


121/41 


















151/51 


















ATC ACG 


AAA 


TAC 


ATG 


TCG 


GCG 


CTG 


GAC 


GTC 


GCC GTG 


CTG 


GCC 


AGC 


TCG 


ACC 


GGA 


CAC 


GAT 


ile thr 


lys 


tyr 


met 


ser 


ala 


leu 


asp 


val 


ala val 


leu 


ala 


ser 


ser 


thr 


gly 


his 


asp 


181/61 


















211/71 


















GTG GAG 


GGG 


GCG 


CAG 


AAA 


AAC 


TTC 


ACC 


GCC 


CGC AAG 


TAC 


GAG 


CTG 


CAG 


ACG 


CGA 


CTG 


GCC 


val glu 


gly 


ala 


gin 


lys 


asn 


phe 


thr 


ala 


arg lys 


tyr 


glu 


leu 


gin 


thr 


arg 


leu 


ala 


241/81 


















271/91 


















GAC ACC 


GAC 


GTC 


ATC 


GCA 


GAC 


GTG 


CGG 


TCG 


GGA GTG 


AAC 


ACG 


CTG 


CTC 


AAC 


GGC 


GGT 


CAG 


asp thr 


asp 


val 


ile 


ala 


asp 


val 


arg 


ser 


gly val 


asn 


thr 


leu 


leu 


asn 


gly 


gly 


gin 


301/101 


















331/111 


















GCG CTG 


CTG 


GAT" 


~AAG~ 


"ATG' 


~CTST 




"GAC 


"AGC 


~ATC~GG~C 


TTG 


CGG 


GAT 


C 










ala leu 


leu 


asp 


lys 


met 


leu 


ala 


asp 


ser 


ile gly 


leu 


arg 


asp; 











SEQ ID Hc5. ^7<7-<y 75 
FIGURE 21A 1 



09/435536 



80/185 



C® ID Uffc 1/1 31/11 

^CGA TCG CGC TTC TGC CGC TGG TAG TGG CGA TGG TGT TAG CAG GAT TGC GGG TCG AGG CTG 

e<5> ifc*o.c/ 77 (arg ser arg phe cys arg trp)AMB(trp arg trp cys) AMB (gin asp cys gly ser arg leu 
61/21 S^iOMc.v7?91/31 ' sek^flo.</V9 

CGA TGG CCA GCA CCA GCG GCC TGC GGC TGG TCG CCG CGC GCG CCG AAA TGA TAC CCG CGA 

arg trp pro ala pro ala ala cys gly trp ser pro arg ala pro lysJoPAftyr pro arg 
121/41 151/51 - - V 



TCA CGA AAT ACA TGT CGG CGC TGG ACG TCG CCG TGC TGG CCA GCT CGA CCG 'SlC^ATC ATG 

ser arg asn thr cys arg arg trp thr ser pro cys trp pro ala arg pro asp thr met 
181/61 211/71 

TGG AGG GGG CGC AGA AAA ACT TCA CCG CCC GCA AGT ACG AGC TGC AGA CGC GAC TGG CCG 

trp arg gly arg arg lys thr ser pro pro ala ser thr ser cys arg arg "asn tro nro 
241/81 271/91 P P 

ACA CCG ACG TCA TCG CAG ACG TGC GGT CGG GAG TGA ACA CGC TGC TCA ACG GCG ■ GTC AGG 

thr pro thr ser ser gin thr cys gly arg glu)oPA/thr arg cys ser thr ala val ara 
301/101 331/111^^^.^ 

CGC TGC TGG ATA AGA TGC TGG CCG ACA GCA TCG GCT TGC GGG ATC 

arg cys trp ile arg cys trp pro thr ala ser ala cys gly ile^ 



SEQ ID 

FIGURE 21B' 



1/1 31/11 
JD q&2 CAC GAT CGC GCT TCT GCC GCT GGT AGT GGC GAT GGT GTT AGC AGG ATT GCG GGT CGA GGC 

(his asp arg ala ser ala ala gly ser gly asp gly val ser arg ile ala gly arg gly 
61/21 91/31 

TGC GAT GGC CAG CAC CAG CGG CCT GCG ^ GCT GGT CGC CGC GCG CGC CGA AAT GAT ACC CGC 
cys asp gly gin his gin arg pro ala ala gly arg arg ala arg arg asn asp thr arg 
121/41 151/51 

GAT CAC GAA ATA CAT GTC GGC GCT GGA CGT CGC CGT GCT GGC CAG CTC GAC CGG ACA CGA 
asp his glu ile his val gly ala gly arg arg arg ala gly gin leu asp arg thr arg 
181/61 211/71 

TGT GGA GGG GGC GCA GAA AAA CTT CAC CGC CCG CAA GTA CGA GCT GCA GAC GCG ACT GGC 
cys gly gly gly ala glu lys leu his arg pro gin val arg ala ala asp ala thr gly 
241/81 271/91 

CGA CAC CGA CGT CAT CGC AGA CGT GCG GTC GGG AGT GAA CAC GCT GCT CAA CGG CGG TCA 
arg his arg arg his arg arg arg ala val gly ser glu his ala ala gin arg arg ser 
301/101 331/111 

GGC GCT GCT GGA TAA GAT GCT GGC CGA CAG CAT CGG CTT GCG GGA TC 
gly ala ala gly^OCH^asp ala gly arg gin his arg leu ala gly) 



-SEQ— I-D -ttosrtf? Z^VS M ' 
FIGURE 21C T 



81/185 



sequence Rv3365c predicted by Cole et al . (Nature 393:537-544) and containing 
Seq21A' 

1/1 31/11 
i& iv5 V^o.t-t^bgtg acc atg ttc gcc cgc ccg acc ate ccg gtc gcg gcg gec get tct gat att tec gec 
(val thr met phe ala arg pro thr ile pro val ala ala ala ala ser asp ile ser ala 
61/21 91/31 

ccg get caa ccg gcc cgc ggc aaa cct cag caa cgc ccg ccg tec tgg teg ccg cgc aac 
pro ala gin pro ala arg gly lys pro gin gin arg pro pro ser trp ser pro arg asn 
, 121/41 151/51 
tgg ccg gtc cga tgg aaa gtg ttc acg ate gcg ctt ctg ccg ctg gta gtg gcg atg gtg 
trp pro val arg trp lys val phe thr ile ala leu leu pro leu val val ala met val 
181/61 211/71 

tta gca gga ttg egg gtc gag get gcg atg gcc age acc age ggc ctg egg ctg gtc gcc 
leu ala gly leu arg val glu ala ala met ala ser thr ser gly leu arg leu val ala 
241/81 271/91 

gcg cgc gcc gaa atg ata ccc gcg ate acg aaa tac atg teg gcg ctg gac gtc gcc gtg 
ala arg ala glu met ile pro ala ile thr lys tyr met ser ala leu asp val ala val 
301/101 331/111 

ctg gcc age teg acc gga cac gat gtg gag ggg gcg cag aaa aac ttc acc gcc cgc aag 
leu ala ser ser thr gly his asp val glu gly ala gin lys asn phe thr ala arg lys 
361/121 391/131 

tac gag ctg cag acg cga ctg gcc gac acc gac gtc ate gca gac gtg egg teg gga gtg 
tyr glu leu gin thr arg leu ala asp thr asp val ile ala asp val arg ser gly val 
421/141 451/151 

J > aac acg 1 ctg 'etc aac ggc ggt cag gcg ctg ctg gat aag gtg ctg gcc gac age ate ggc 

asn thr leu leu asn gly gly gin ala leu leu asp lys val leu ala asp ser ile gly 
481/161 511/171 

ttg egg gat egg gtc acc gcc tac gcg ccg ctg ctg ttg acg gcc cag aac gtg att gac 
leu arg asp arg val thr ala tyr ala pro leu leu leu thr ala gin asn val ile asp 
541/181 571/191 

gcg teg gtg egg gtt gac age gag caa ate cga acc cag gtg cag ggt ttg age cga gcc 
ala ser val arg val asp ser glu gin ile arg thr gin val gin gly leu ser arg ala 
601/201 631/211 

gtt ggc gcc cgc ggg cag atg acg atg cag gag ate ctg gtg act cgc ggc gcc gac ctt 
val gly ala arg gly gin met thr met gin glu ile leu val thr arg gly ala asp leu 
661/221 691/231 

gcc gag ccg caa ctg cgc age gcg atg gtt acc ctg gcc ggc acc gaa ccc teg acg ctg 
ala glu pro gin leu arg ser ala met val thr leu ala gly thr glu pro ser thr leu 
721/241 751/251 

ttc ggg atg age gcg gcg etc ggt gca ggc teg ccg gac acc aag aac ctg cag cag caa 
phe gly met ser ala ala leu gly ala gly ser pro asp thr lys asn leu gin gin gin 
781/261 811/271 

atg gtg acc agg atg gcg ate atg tec gat ccg gcc gtt gca ctg gtc aac aac cca gag 
met val thr arg met ala ile met ser asp pro ala val ala leu val asn asn pro glu 
841/281 871/291 

ctg ctg cac teg ata cag ate acc cgc gac att gcc gag cag gtg ate acc gac acc acc 
leu leu his ser ile gin ile thr. arg asp ile ala glu gin val ile thr asp thr thr 

-961V-3G-1- 93-1-/-31-1- 



gag gcg gtg acg aag teg gtg caa age cag gcc acc gac egg egg gat gcc gcg att cgc 

glu ala val thr lys ser val gin ser gin ala thr asp arg arg asp ala ala ile arg 
961/321 991/331 

gac gcc gtg ctg gtg ttg gcc gcc ate gcg acc gcg ate gtc gtc gtg ttg gtg gtg gcg 

asp ala val leu val leu ala ala ile ala thr ala ile val val val leu val val ala 



SEQ ID :*a=>vS5-V8G> 



FIGURE 21D 



097485556 

82/185 



1021/341 
cgc acg ctg 
arg thr leu 
1081/361 
gat etc gac 
asp leu asp 
1141/381 
gcg gtg tac 
ala val tyr 
1201/401 
egg gee ctg 
arg ala leu 
1261/421 
gag ace atg 
glu thr met 
1321/441 
ctg gag cgc 
leu glu arg 
1381/461 
gec gec egg 
ala ala arg 
1441/481 
cgt gac cac 
arg asp his 
1501/501 
gag gac tat 
glu asp tyr 
1561/521 
get ggt ggc 
ala gly gly 
1621/541 
ccg acc aca 
pro thr thr 
1681/561 
ate teg gat 
ile ser asp 
1741/581 
egg gec ggc 
arg ala gly 
1801/601 
egg ctg gee 
arg leu ala 
1861/621 
ggc acc ggc 
gly thr gly 
1921/641 
gcg cag ccg 
ala gin pro 
1981/661 
ycc gat ccg- 



gtc 
val 

ggc 
gly 

acc 
thr 

ttg 
leu 

teg 
ser 

aac 
asn 

ctg 
leu 

cgc 
arg 

cgc 
arg 

gtc 
val 

ccc 
pro 

tec 
ser 

ggt 
gly 

ggt 
gly 

acc 
thr 

cca 
pro 



ggg ccg 
gly pro 

gag ate 
glu ile 

acc gag 
thr glu 

ctg gee 
leu ala 

egg cgt 
arg arg 

gag gag 
glu glu 

cgc cgc 
arg arg 

gag ccg 
glu pro 

cgc gtc 
arg . val 

att cat 
ile his 

gtt egg 
val arg 

ggc ctg 
gly leu 

gag gtc 
glu val 

egg cac 
arg his 

acc gee 
thr ala 

aag ccg 
lys pro 



atg 
met 

gcg 
ala 

gaa 
glu 

ggc 
gly 

age 
ser 

gat 
asp 

aac 
asn 

gtg 
val 

gac 
asp 

ctg 
leu 

gtt 
val 

ggc 
gly 

acc 
thr 

ggc 
gly 

gag 
glu 

egg 
arg 



egg gta 
arg val 

gcg gtc 
ala val 

ate ggt 
ile gly 

gag gaa 
glu glu 

cgt tec 
arg ser 

ccc gee 
pro ala 

age gec 
ser ala 

ccg ctg 
pro leu 

ate gcg 
ile ala 

ctt gee 
leu ala 

gee gee 
ala ala 

atg acc 
met thr 

ccg gat 
pro asp 

ate cga 
ile arg 

gtc tac 
val tyr 

gta ttt 
val phe 



ctg 
leu 

cgc 
arg 

cag 
gin 

acg 
thr 

ctg 
leu 

cga 
arg 

aac 
asn 

tea 
ser 

agg 
arg 

gag 
glu 

gca 
ala 

gat 
asp 

agt 
ser 

gtc 
val 

ctg 
leu 



gcg 
ala 



1051/351 
cgt gat ggg 
arg asp gly 
1111/371 
gee ggc gac 
ala gly asp 
1171/391 
gtc gcg cat 
val ala his 
1231/411 
egg ttg cga 
arg leu arg 
1291/431 
gtc gac cag 
val asp gin 
1351/451 
etc gac age 
leu asp ser 
1411/471 
ctg ctg gtg 
leu leu val 
1471/491 
acc gtg ate 
thr val ile 
1531/511 
gta ccc gac 
val pro asp 
1591/531 
ctg ate gac 
leu ile asp 
1651/551 
ate ggc age 
ile gly ser 
1711/571 
gec gat egg 
ala asp arg 
1771/591 
gee egg cac 
ala arg his 
1831/611 
ggg ctg cgc 
gly leu arg 
1891/631 
ccg eta gee 
pro leu ala 
1951/651 
ate aag ccg 
ile lys pro 
2011/671 



gcg 
ala 

gag 
glu 

gcg 
ala 

ctg 
leu 



cag 
gin 

ctt 
leu 

ctg 
leu 

age 
ser 

tgt 
cys 

aac 
asn 

gaa 
glu 

egg 
arg 

atg 

met 

ggt 
gly 

gtg 
val 

ccg 
pro 



etc 
leu 

ccg 
pro 

gtc 
val 

ctg 
leu 

ctg 
leu 

ttc 
phe 

gec 
ala 



gec 
ala 



gcg 
ala 

gcg 
ala 

ggc 
gly 

atg 
met 

ggt 
gly 

ccg 
pro 

etc 
leu 

tgt 
cys 



aag 
lys 


gtt 
val 


get 
ala 


cat 

his 


acc 
thr 


ate 
ile 


ccc 
pro 


gag 
glu 


cca 
pro 


ctg 
leu 


gac 
asp 


gag 
glu 


ctg 
leu 


cac 
his 


acc 
thr 


gtc 
val 


aac 
asn 


gag 
glu 


atg 
met 


ttt 
phe 


teg 
ser 


gtc 
val 


ate 
ile 


gac 
asp 


caa 
gin 


egg 
arg 


etc 
leu 


gat 
asp 


cac 
his 


ctg 
leu 


ggt 
gly 


gcg 
ala 


cag 
gin 


att 
ile 


acc 
thr 


gee 
ala 


gtg 
val 


tea 
ser 


gag 
glu 


gtc 
val 


gta 
val 


gtc 
val' 


ggc 
gly 


gca 
ala 


9 C 3 
ala' 


ttg 
leu 


cgc 
arg 


tac 
tyr 


teg 
ser 


tea 
ser 


agt 
ser 


gtt 
val 


ctg 
leu 


ctg 
leu 


cga 
arg 


gee 
ala 


aat 
asn 


atg 
met 


egg 
arg 


ctg 
leu 


ctg 
leu 


ttc 
phe 


gta 
val 


gtc 

val 


ggc 
gly 


gtg 
val 


acc 
thr 


ggt 
gly 


gaa 
glu 


cag 
gin 


gag 
glu 


ggg 

gly 


acg 
thr 


gee 
ala 


cca 
pro 


cct 
pro 


gaa 
glu 


ccc 
pro 


gcg 
ala 


gcg 
ala 



^a-cg— ga-c~ 



ala asp pro thr asp val pro ala ala ile gly pro leu pro pro 



g tc a-eg— trtg— ctrc— cc g- 
val thr leu leu pro 



SEQ ID tx&us&yfb (continued 1) 
FIGURE 21D (continued 1) 



83/185 



2041/681 

cgc cgt acc ccg ggg 
arg arg thr pro gly 
2101/701 

egg cgc gag ctg aaa 
arg arg glu leu lys 
2161/721 

ccc gca cca gaa ccg 
pro ala pro glu pro 
2221/741 

gtt gat gac gac gtc 
val asp asp asp val 
2281/761 

ctg gcc cac age ccc 
leu ala his ser pro 
2341/781 

gcc gag gcc gcg gac 
ala glu ala ala asp 
2401/801 

ccc ggg gcc egg tta 
pro gly ala arg leu 
2461/821 

ggt gca gcg eta gca 
gly ala ala leu ala 
2521/841 

gcg gta cgc gac ccc 
ala val -arg- asp pro 
2581/861 

acc ggg egg teg cat 
thr gly arg ser his 



tec agt ggc ate gcc 
ser ser gly ile ala 

aca ccc tgg tgg gag 
thr pro trp trp glu 

cga ccg gcg ccg ccg 
arg pro ala pro pro 

ate tac egg egg atg 
ile tyr arg arg met 

gat ctg gac tgg aag 
asp leu asp trp lys 

aag ccc gtg cag tec 
lys pro val gin ser 

gtg ccg ggg gcg gcg 
val pro gly ala ala 

tec aac ggc gga ctt 
ser asn gly gly leu 

gac gcg gtt cgt gcc 
asp ala val arg ala 

gcc cgc gag age agt 
ala arg glu ser ser 



2071/691 

gac gtc ccg gcc cag 
asp val pro ala gin 
2131/7.11 

gat agg ttt caa cag 
asp arg phe gin gin 
2191/731 

ccc gcc aaa ccc gcg 
pro ala lys pro ala 
2251/751 

etc tec gag atg gtg 
leu ser glu met val 
2311/771 

teg gtg tgg gac cac 
ser val trp asp his 
2371/791 

cgc acg gac tac ggc 
arg thr asp tyr gly 
2431/811 

gtg cct gag gga ccc 
val pro glu gly pro 
2491/831 

cat ccc ggc cga gcg 
his pro gly arg ala 
2551/851 

tec ate age age cat 
ser ile 1 ser ser his 
2611/871 

cag gga ccc aat cag 
gin gly pro asn gin 



ccg atg cag cag egg 
pro met gin gin arg 

gag ccc aaa caa ccg 
glu pro lys gin pro 

cca ccg gcg ggc ccg 
pro pro ala gly pro 

ggt gac ccg cac gag 
gly asp pro his glu 

ggc tgg teg gcg gcc 
gly trp ser ala ala 

ctg ccg gtg cgc gaa 
leu pro val arg glu 

gat egg gag cat ccg 
asp arg glu his pro 

ccg egg cac gcg get 
pro arg his ala ala 

ttc ggc ggc gtg cgc 
phe gly'gly val arg 

caa tga 
gln^OPA 



SEQ ID K/js. </*5-v*fc< continued) 
FIGURE 21D (continued) 



09/485536 



84/185 



ORF according to Cole et al . (Nature 393:537-544) and containing Rv3365c 
=<» i^t^o.W7taa ggg tgc ggc egg tgg cac ggc cgc ggc cac gtg acc atg ttc gec cgc ccg ace ate 

»^uto.s»-ii^v y cys gly arg trp his gly arg gly 9i/ 3 r x thr met phe aia arg pro thr iie 

ccg gtc gcg gcg gec get tct gat att tec gec ccg get caa ccg gee cgc ggc aaa cct 
pro val ala ala ala ala ser asp ile ser ala pro ala gin pro ala arg gly lys pro 
121/41 151/51 

cag caa cgc ccg ccg tec tgg teg ccg cgc aac tgg ccg gtc cga tgg aaa gtg ttc acg 
gin gin arg pro pro ser trp ser pro arg asn trp pro val arg trp lys val phe thr 
181/61 211/71 

ate gcg ctt ctg ccg ctg gta gtg gcg atg gtg tta gca gga ttg egg gtc gag get gcg 
lie ala leu leu pro leu val val ala met val leu ala gly leu arg val glu ala ala 
241/81 271/91 

atg gee age acc age ggc ctg egg ctg gtc gee gcg cgc gee gaa atg ata ccc gcg ate 
met ala ser thr ser gly leu arg leu val ala ala arg ala glu met ile pro ala ile 
301/101 ^ 331/111 

acg aaa tac atg teg gcg ctg gac gtc gee gtg ctg gee age teg acc gga cac gat gtg 
thr lys tyr met ser ala leu asp val ala val leu ala ser ser thr gly his asp val 
361/121 391/131 

gag ggg gcg cag aaa aac ttc acc gee cgc aag tac gag ctg cag acg cga ctg gec gac 
glu gly ala gin lys asn phe thr ala arg lys tyr glu leu gin thr arg leu ala asp 
421/141 451/151 

acc gac gtc ate gca gac gtg egg teg gga gtg aac acg ctg etc aac ggc ggt cag gcg 
thr asp val ile ala asp val arg ser gly val asn thr leu leu asn ,g,ly gly gin ala 
481/161 f 511/171 

ctg ctg gat aag gtg ctg gee gac age ate ggc ttg egg gat egg gtc acc gec tac gcg 
leu leu asp lys val leu ala asp ser ile gly leu arg asp arg val thr ala tyr ala 
541/181 571/191 

ccg ctg ctg ttg acg gee cag aac gtg att gac gcg teg gtg egg gtt gac age gag caa 
pro leu leu leu thr ala gin asn val ile asp ala ser val arg val asp ser glu gin 
601/201 631/211 

ate cga acc cag gtg cag ggt ttg age cga gee gtt ggc gee cgc ggg cag atg acg atg 
ile arg thr gin val gin gly leu ser arg ala val gly ala arg gly gin met thr met 
661/221 691/231 

cag gag ate ctg gtg act cgc ggc gec gac ctt gee gag ccg caa ctg cgc age gcg atg 
gin glu ile leu val thr arg gly ala asp leu ala glu pro gin leu arg ser ala met 
721/241 751/251 

gtt acc ctg gee ggc acc gaa ccc teg acg ctg ttc ggg atg age gcg gcg etc ggt gca 
val thr leu ala gly thr glu pro ser thr leu phe gly met ser ala ala leu gly ala 
781/261 811/271 

ggc teg ccg gac acc aag aac ctg cag cag caa atg gtg acc agg atg gcg ate atg tec 
gly ser pro asp thr lys asn leu gin gin gin met val thr arg met ala ile met ser 
841/281 871/291 

gat ccg gee gtt gca ctg gtc aac aac cca gag ctg ctg cac teg ata cag ate acc cgc 
asp pro ala val ala leu val asn asn pro glu leu leu his ser ile gin ile thr arg 



SEQ ID ltoe>.4*7-M^8 
FIGURE 2i£ 



09/4 85 536 

85/185 



901/301 

gac att gcc gag cag 
asp ile ala glu gin 
961/321 

cag gcc acc gac egg 
gin ala thr asp arg 
1021/341 

gcg acc gcg ate gtc 
ala thr ala ile val 
1081/361 

ctg cgt gat ggg gcg 
leu arg asp gly ala 
1141/381 

cgc gcc ggc gac gag 
arg ala gly asp glu 
1201/401 

cag gtc gcg cat gcg 
gin val ala his ala 
1261/421 

acg egg ttg cga ,ctg 
thr arg leu arg leu 
1321/441 

ctg gtc gac cag cag 
leu val asp gin gin 
1381/461 

cga etc gac age ctt 
arg leu asp ser leu 
1441/481 

aac ctg ctg gtg ctg 
asn leu leu val leu 
1501/501 

tea acc gtg ate age 
ser thr val ile ser 
1561/521 

agg gta ccc gac tgt 
arg val pro asp cys 
1621/541 

gag ctg ate gac aac 
glu leu ile asp asn 
1681/561 

gca ate ggc age gaa 
ala ile gly ser glu 
1741/581 

gat gcc gat egg egg 
asp ala asp arg arg 
1801/601 

agt gcc egg cac atg 
ser ala arg his met 



gtg ate acc gac acc 
val ile thr asp thr 

egg gat gcc gcg att 
arg asp ala ala ile 

gtc gtg ttg gtg gtg 
val val leu val val 

etc aag gtt get cat 
leu lys val ala his 

ccg ate ccc gag cca 
pro ile pro glu pro 

gtc gac gag ctg cac 
val asp glu leu his 

ctg gtc aac gag atg 
leu val asn glu met 

ctg teg gtc ate gac 
leu ser val ile asp 

ttcegg etc gat cac 
phe arg leu asp his 

gcc ggt gcg cag att 
ala gly ala gin ile 

gcc gcc gtg tea gag 
ala ala val ser glu 

gcg gta gtc ggc gca 
ala val val gly ala 

gcg ttg cgc tac teg 
ala leu arg tyr ser 

ggc agt gtt ctg ctg 
gly ser val leu leu 

atg gcc aat atg egg 
met ala asn met arg 

ggt ctg ttc gta gtc 
gly leu phe val val 



931/311 

acc gag gcg gtg acg 
thr glu ala val thr 
991/331 

cgc gac gcc gtg ctg 
arg asp ala val leu 
1051/351 

gcg cgc acg ctg gtc 
ala arg thr leu val 
1111/371 

acc gat etc gac ggc 
thr asp leu asp gly 
1171/391 

ctg gcg gtg tac acc 
leu ala val tyr thr 
1231/411 

acc egg gcc ctg ttg 
thr arg ala leu leu 
1291/431 

ttt gag acc atg teg 
phe glu thr met ser 
1351/451 

caa ctg gag cgc aac 
gin leu glu arg asn 
1411/471 

ctg gcc, gcc egg ctg 
leu ala ala arg leu 
1471/491 

acc cgt gac cac cgc 
thr arg asp his arg 
1531/511 

gtc gag gac tat cgc 
val glu asp tyr arg 
1591/531 

gcg get ggt ggc gtc 
ala ala gly gly val 
1651/551 

tea ccg acc aca ccc 
ser pro thr thr pro 
1711/571 

cga ate teg gat tec 
arg ile ser asp ser 
1771/591 

ctg egg gcc ggc ggt 
leu arg ala gly gly 
1831/611 

ggc egg ctg gcc ggt 
gly arg leu ala gly 



aag teg gtg caa age 
lys ser val gin ser 

gtg ttg gcc gcc ate 
val leu ala ala ile 

ggg ccg atg egg gta 
gly pro met arg val 

gag ate gcg gcg gtc 
glu ile ala ala val 

acc gag gaa ate ggt 
thr glu glu ile gly 

ctg gcc ggc gag gaa 
leu ala gly glu glu 

egg cgt age cgt tec 
arg arg ser arg ser 

gag gag gat ccc gcc 
glu glu asp pro ala 

cgc cgc,,, aac .age gcc 
arg arg asn ser ala 

gag ccg gtg ccg ctg 
glu pro val pro leu 

cgc gtc gac ate gcg 
arg val asp ile ala 

att cat ctg ctt gcc 
ile his leu leu ala 

gtt egg gtt gcc gcc 
val arg val ala ala 

ggc ctg ggc atg acc 
gly leu gly met thr 

gag gtc acc ccg gat 
glu val thr pro asp 

egg cac ggc ate cga 
arg his gly ile arg 



SEQ ID Kcs>. q$7. vgr (continued 1) 
FIGURE 2 IE (continued 1) 



86/185 



1861/621 

gtc ggg ctg cgc ggt ccg gtg acc ggt gaa 
val gly leu arg gly pro val thr gly glu 
1921/641 

ctg ccg eta gec gtg etc gag ggg acg gec 
leu pro leu ala val leu glu gly thr ala 
1981/661 

gcg ate aag ccg ccg tgt cct gaa ccc gcg 
ala ile lys pro pro cys pro glu pro ala 
2041/681 

ate ggg ccg eta cca ccg gtc acg ttg etc 
ile gly pro leu pro pro val thr leu leu 
2101/701 

gec gac gtc ccg gec cag ccg atg cag cag 
ala asp val pro ala gin pro met gin gin 
2161/721 

gag gat agg ttt caa cag gag ccc aaa caa 
glu asp arg phe gin gin glu pro lys gin 
2221/741 

ccg ccc gee aaa ccc gcg cca ccg gcg ggc 
pro pro ala lys pro ala pro pro ala gly 
2281/761 

atg etc tec gag atg gtg ggt gac ccg cac 
met leu ser glu met val gly asp pro his 
2341/781 

aag teg gtg tgg gac cac ggc tgg teg gcg 
lys ser val trp asp his gly trp ser ala 
2401/801 

tec cgc acg gac tac ggc ctg ccg gtg cgc 
ser arg thr asp tyr gly leu pro val arg 
2461/821 

gcg gtg cct gag gga ccc gat egg gag cat 
ala val pro glu gly pro asp arg glu his 
2521/841 

ctt cat ccc ggc cga gcg ccg egg cac gcg 
leu his pro gly arg ala pro arg his ala 
2581/861 

gee tec ate age age cat ttc ggc ggc gtg 
ala ser ile ser ser his phe gly gly val 
2641/881 

agt cag gga ccc aat cag caa tga 
ser gin gly pro asn gin gln^OPA 



1891/631 

cag ggc acc ggc acc acc gee gag gtc tac 
gin gly thr gly thr thr ala glu val tyr 
1951/651 

cca gcg cag ccg cca aag ccg egg gta ttt 
pro ala gin pro pro lys pro arg val phe 
2011/671 

gcg gee gat ccg acg gac gtt ccc gec gec 
ala ala asp pro thr asp val pro ala ala 
2071/691 

ccg cgc cgt acc ccg ggg tec agt ggc ate 
pro arg arg thr pro gly ser ser gly ile 
2131/711 

egg egg cgc gag ctg aaa aca ccc tgg tgg 
arg arg arg glu leu lys thr pro trp "trp 
2191/731 

ccg ccc gca cca gaa ccg cga ccg gcg ccg 
pro pro ala pro glu pro arg pro ala pro 
2251/751 

ccg gtt gat gac gac gtc ate tac egg egg 
pro val asp asp asp val ile tyr atg arg 
2311/771 

gag ctg gee cac age ccc gat ctg gac tgg 
glu leu ala his ser pro asp leu asp t trp 
2371/791 

gee gee gag gec gcg gac aag ccc gtg cag 
ala ala glu ala ala asp lys pro val gin 
2431/811 

gaa ccc ggg gec egg tta gtg ccg ggg gcg 
glu pro gly ala arg leu val pro gly ala 
2491/831 

ccg ggt gca gcg eta gca tec aac ggc gga 
pro gly ala ala leu ala ser asn gly gly 
2551/851 

get gcg gta cgc gac ccc gac gcg gtt cgt 
ala ala val arg asp pro asp ala val arg 
2611/871 

cgc acc ggg egg teg cat gee cgc gag age 
arg thr gly arg ser his ala arg glu ser 



SEQ ID Mc&. -3frs( continued 2) 



FIGURE 21B (continued 2) 



87/185 



31/11 

dS>.t5*X/S<?CTA CGA CAA GGC AAA GGA GCA CAG GGT GAA GCG TGG ACT GAC GGT CGC GGT AGC CGG AGC 

eftto^qCTA ^ giy giu trp thr asp gly arg gly ser arg ser 

«»tt>*£>. W> /Ol 91/31 

CGC CAT TCT GGT CGC AGG TCT TTC CGG ATG TTC AAG CAA CAA GTC GAC TAC AGG AAG CGG 
arg his ser gly arg arg ser phe arg met P^^Y* 9 ln 3 ln val as P tyr arg lyS arg ) 

TGAGAC CAC GAC CGC GGC AGG CAC GAC GGC AAG CCC CGG CGC CGC ATC CGG GCC GAA GGT 

Spijfasp his asp arg gly arg his asp gly lys pro arg arg arg lie arg aia glu gly 
»iDH*.V9i -TfiTyei 211/71 

CGT CAT CGA CGG TAA GGA CCA GAA CGT CAC CGG GTC TGT GGT GTG CAC AAC CGC GGC CGG 

arg his arg arg) OCH (gly pro glu arg his arg val cys gly val his asn arg gly arg 

CAa'tGT CAA CAT t£ G^CG^CGG GGC GGC GAc'cGG CAT TGC CGC CGT GCT CAC CGA CGG 
Sn cys gin his arg asp arg arg gly gly asp arg his cys arg arg aia h 1S arg arg 

33X/111 

CAA 7 CCC TCC GGA GGT GAA GTC CGT TGG GCT CGG TAA CGT CAA CGG CGT CAC GCT GGG ATA 
gin pro ser gly gly glu val arg trp aia arg) OCH (arg gin arg arg his aia gly He 
?>£i /i ?i 39l/l31ae® it> no.v<?3 

CAC GTC GGG CAC CGG ACA GGG TAA CGC TCG GCA ACC AAG GAC GGC AGC CAC TAC AAG ATC 
nts val gly his arg thr gly)oCH £rg ser aia thr lys asp gly ser his tyr lys He/ 



SEQ ID <^ 
FIGURE 22A 



-**-~ & z £ - s s; ss £ - s s b - is s s s x s 

GCC ATT CTG GTC GCA GGT CTT TCC GGA TGT TCA AGC AAC AAG TCG ACT ACA GGA AGC GGT 
til Til llu Hi aia gly leu ser gly cys server asn lys ser thr thr gly ser gly 

GAG 7 ACC ACG ACC GCG GCA GGC ACG ACG GCA AGc'cCC GGC GCC GCA TCC GGG CCG AAG GTC 
glu ttr tS ^r aia aia gly thr thr aia ser^pro gly aia aia ser gly pro lys val 

GTC^TC GAC GGT AAG GAC CAG AAC GTC ACC GGg'tCT GTG GTG TGC ACA ACC GCG GCC GGC 
val He atp gly lys asp gin asn val thr gl^ser val val cys thr thr aia aia gly 

Iat'gTC AAC ATC GCG ATC GGC GGG GCG GCG ACC GGC ATT GCC GCC GTG CTC ACC GAC GGC 
Itn 111 atn tie aia ile gly gly aia aia thr^gly ile aia aia val leu thr asp gly 

^c'cCT CCG GAG GTG AAG TCC GTT GGG CTC GGt'aAC GTC AAC GGC GTC ACG CTG GGA TAC 
itn pro pro glu val lys ser val gly leu gly asn val asn gly val thr leu gly tyr 

ac^TCG CGC ACC GGA PAG GGT AAC GCT CGG cIa'cCA AGG ACG GCA GCC ACT ACA AGA TC_ 

Sr ser gly thr gly gin gly asn aia arg gin pro arg thr aia aia tnr cnr 

SEQ ID. Uo&. </<?5- 
FIGURE 22B 



88/185 



33/11 



/7D £> UO- W ACG ACA 


AGG 


CAA 


AGG 


AGC 


ACA 


GGG 


TGA 


AGC 


GTG GAC 


TGA 


CGG 


TCG 


CGG 


TAG 


CCG 


GAG 


CCG 




thr thr 


arg 


gin 


arg 


ser 


thr 


giy) 


OPA 


ser 


val asp 


OPA 


arg 


ser 


arg 


AMB / 


pro 


glu 


pro 


63/21 












93/31 














CCA TTC 


TGG 


TCG 


CAG 


GTC 


TTT 


CCG 


GAT 


GTT 


CAA GCA 


ACA 


AGT 


CGA 


CTA 


CAG 


GAA 


GCG 


GTG 




i™4 r*o nhp 
yi u 


trp 


ser 


gin 


val 


phe 


pro 


asp 


val 


gin ala 


thr 


ser 


arg 


leu 


gin 


glu 


ala 


val 




123/41 


















153/51 




















AGA CCA 


CGA 


CCG 


CGG 


CAG 


GCA 


CGA 


CGG 


CAA 


GCC CCG 


GCG 


CCG 


CAT 


CCG 


GGC 


CGA 


AGG 


TCG 






arg 


pro 


arg 


gin 


ala 


arg 


arg 


gin 


ala pro 


ala 


pro 


his 


pro 


gly 


arg 


arg 


ser 




183/61 
















213/71 




















TCA TCG 


ACG 


GTA 


AGG 


ACC 


AGA 


ACG 


TCA 


CCG 


GGT CTG 


TGG 


TGT 


GCA 


CAA 


CCG 


CGG 


CCG 


GCA 




ser ser 


thr 


val 


arg 


thr 


arg 


thr 


ser 


pro 


gly leu 


trp 


cys 


ala 


gin 


pro 


arg 


pro 


ala 




243/81 


















273/91 




















ATG TCA 


ACA 


TCG 


CGA 


TCG 


GCG 


GGG 


CGG 


CGA 


CCG GCA 


TTG 


CCG 


CCG 


TGC 


TCA 


CCG 


ACG 


GCA 




met ser 


thr 


ser 


arg 


ser 


ala 


gly 


arg 


arg 


pro ala 


leu 


pro 


pro 


cys 


ser 


pro 


thr 


ala 




303/101 


















333/111 




















ACC CTC 


CGG 


AGG 


TGA 


AGT 


CCG 


TTG 


GGC 


TCG 


GTA ACG 


TCA 


ACG 


GCG 


TCA 


CGC 


TGG 


GAT 


ACA 




thr leu 


arg 


arg 


)0PA 


/ser 


pro 


leu 


gly 


ser 


val thr 


ser 


thr 


ala 


ser 


arg 


trp 


asp 


thr 




363/121 












393/131 




















CGT CGG 


GCA 


CCG 


GAC 


AGG 


GTA 


ACG 


CTC 


GGC 


AAC CAA 


GGA 


CGG 


CAG 


CCA 


CTA 


CAA 


GAT 


c 




arg arg 


ala 


pro 


asp 


arg 


val 


thr 


leu 


gly 


asn gin 


gly 


arg 


gin 


pro 


leu 


gin 


asp 


) 



SEQ ID K£0.t«7-5£0 



FIGURE 22C 



31/11 

e$ lfcAo. GCA CAA CCG CGG CCG GCA ATG TCA ACA TCG CGA TCG GCG GGG CGG CGA CCG GCA TTG CCG 
/ala gin pro arg pro ala met ser thr ser arg ser ala gly arg arg pro ala leu pro 

t£> to«o.6D2 ^ 61 /2i 91/31 

CCG TGC TCA CCG ACG GCA ACC CTC CGG AGG TGA AGT CCG TTG GGC TCG GTA ACG TCA ACG 
pro cys ser pro thr ala thr leu arg arg) OPA (ser pro leu gly ser val thr ser thr 
121/41 151/51 %£& lO VLo- *fig> 

GCG TCA CGC TGG GAT ACA CGT CGG GCA CCG GAC AGG GTA ACG CCT CGG CAA CCA AGG ACG 
ala ser arg trp asp thr arg arg ala pro asp arg val thr pro arg gin pro arg thr 
181/61 211/71 

GCA GCC ACT ACA AGA TCA CAG GGT GAA GCG TGG ACT GAC GGT CGC GGT AGC CGG AGC CGC 
ala ala thr thr arg ser gin gly glu ala trp thr asp gly arg gly ser arg ser arg 
241/81 271/91 

CAT TCT GGT CGC AGG TCT TTC CGG ATG TTC AAG CAA CAA GTC GAC TAC AGG AAG CGG TGA 
his ser gly arg arg ser phe arg met phe lys gin gin val asp tyr arg lys arg) OPA 
301/101 331/111 

GAC CAC GAC CGC GGC AGG CAC GAC GGC AAG CCC CGG CGC CGC TCC GGG CCG AAG GTC GTC 

asnD^ew-i^p his asp arg gly arg his asp gly lys pro arg arg arg ser gly pro lys vai val 

361/121 391/131 

ATC GAC GGT AAG GAC CAG AAC GTC ACC GGC TCC GTG GTG TGC ACA ACC GCG. GCC GGC AAT 
ile asp gly lys asp gin asn val thr gly ser val val cys thr thr ala ala gly asn 
421/141 451/151 

GTC AAC A TC GC G A TC GGC GGG GCG G CG ACC GGC ATT GCC GCC GTG CTC ACC GAC GGC AAC 

val asn ile ala ile gly gly ala ala thr gly ile ala ala val leu thr asp gly asn 
481/161 511/171 

CCT CCG GAG GTG AAG TCC GTT GGG CTC GGT AAC GTC AAC GGC GTC ACG CTG GGA TAC ACG 
pro pro glu val lys ser val gly leu gly asn val asn gly val thr leu gly tyr thr 
541/181 571/191 

TCG GGC ACC GGA CAG GGT AAC GCC TCG GCA ACC AAG GAC GGC AGC CAC TAC AAG ATC 
ser gly thr gly gin gly asn ala ser ala thr lys asp gly ser his tyr lys ile) 

SEQ ID lA£o.60/-a^/ 



FIGURE 23A 



89/185 



32/11 



S> it> £*££AC AAC 


CGC 


GGC 


CGG 


CAA 


TGT 


CAA 


CAT 


CGC 


GAT CGG 


CGG 


GGC 


GGC 


GAC 


CGG 


CAT 


TGC 


CGC 


w 'UWX 62/21 


arg 


gly 


arg 


gin 


cys 


gin 


his 


arg 


asp arg 
92/31 


arg 


gly gly 


asp 


arg 


his 


cys 


arg 


CGT GCT 


CAC 


CGA 


CGG 


CAA 


CCC 


TCC 


GGA 


GGT 


GAA GTC 


CGT 


TGG 


GCT 


CGG 


TAA 


CGT 


CAA 


CGG 


arg ala 


his 


arg 


arg 


gin 


pro 


ser 


gly 


gly 


glu val 


arg 


trp 


ala 


arg) 


OCH/ 


arg 


gin 


arg 


122/41 


















152/51 












CGT CAC 


GCT 


GGG 


ATA 


CAC 


GTC 


GGG 


CAC 


CGG 


ACA GGG 


TAA 


CGC 


CTC 


GGC 


AAC 


CAA 


GGA 


CGG 


arg his 


ala 


gly 


ile 


his 


val 


gly 


his 


arg 


thr gly} 


OCH 


(arg 


leu 


gly 


asn 


gin 


gly 


arg 


182/61 


















212/71 
















CAG CCA 


CTA 


CAA 


GAT 


CAC 


AGG 


GTG 


AAG 


CGT 


GGA CTG 


ACG 


GTC 


GCG 


GTA 


GCC 


GGA 


GCC 


GCC 


gin pro 


leu 


gin 


asp 


his 


arg 


val 


lys 


arg 


gly leu 


thr 


val 


ala. 


val 


ala 


gly 


ala 


ala 


242/81 


















272/91 


















ATT CTG 


GTC 


GCA 


GGT 


CTT 


TCC 


GGA 


TGT 


TCA 


AGC AAC 


AAG 


TCG 


ACT 


ACA 


GGA 


AGC 


GGT 


GAG 


ile leu 


val 


ala 


gly 


leu 


ser 


gly 


cys 


ser 


ser asn 


lys 


ser 


thr 


thr 


gly 


ser 


gly 


glu 


302/101 


















332/111 


















ACC ACG 


ACC 


GCG 


GCA 


GGC 


ACG 


ACG 


GCA 


AGC 


CCC GGC 


GCC 


GCT 


CCG 


GGC 


CGA 


AGG 


TCG 


TCA 


thr thr 


thr 


ala 


ala 


gly 


thr 


thr 


ala 


ser 


pro gly 


ala 


ala 


pro 


gly 


arg 


arg 


ser 


ser 


362/121 


















392/131 


















TCG ACG 


GTA 


AGG 


ACC 


AGA 


ACG 


TCA 


CCG 


GCT 


CCG TGG 


TGT 


GCA 


CATV 


CCG 


CGG 


CCG 


GCA 


ATG 


ser thr 


val 


arg 


thr 


arg 


thr 


ser 


pro 


ala 


pro trp 


cys 


ala 


gin 


pro 


arg 


pro 


ala 


met 


422/141 


















452/151 


















TCA ACA 


TCG 


CGA 


TCG 


GCG 


GGG 


CGG 


CGA 


CCG 


GCA TTG 


CCG 


CCG 


TGC 


TCA 


CCG 


ACG 


GCA 


ACC 


ser thr 


ser 


arg 


ser 


ala 


gly 


arg 


arg 


pro 


ala leu 


pro 


pro 


cys 


ser 


pro 


thr 


ala 


thr 


482/161 


















512/171 


















CTC CGG 


AGG 


TGA 


AGT 


CCG 


TTG 


GGC 


TCG 


GTA 


ACG TCA 


ACG 


GCG 


TCA 


CGC 


TGG 


GAT 


ACA 


CGT 


J 1 leu arg 


arg 


Wpa 


( ser 


pro 


' leu 


gly 


ser' 


val 


thr ser 


thr 


ala 


ser: 


arg 


trp 


asp 


thr 


arg 


542/181 












572/191 


















CGG GCA 


CCG 


GAC 


AGG 


GTA 


ACG 


CCT 


CGG 


CAA 


CCA AGG 


ACG 


GCA 


GCC 


ACT 


ACA 


AGA 


TC 




arg ala 


pro 


asp 


arg 


val 


thr 


pro 


arg 


gin 


pro arg 


thr 


ala 


ala 


thr 


thr 


arg 







SEQ ID lAc<5. 606 -60fl 
FIGURE 23B 



90/185 



33/11 

:©iDKo.&lO A CA ACC GCG GCC GGC AAT GTC AAC ATC GCG ATC GGC GGG GCG GCG ACC GGC ATT GCC GCC 

f thr thr ala ala 9 x y asn val asn ile ala ile gly s ly ala ala thr siy ile ala ala 

63/21 93/31 

GTG CTC ACC GAC GGC AAC CCT CCG GAG GTG AAG TCC GTT GGG CTC GGT AAC GTC AAC GGC 
val leu thr asp gly asn pro pro glu val lys ser val gly leu gly asn val asn gly 
123/41 153/51 

GTC ACG CTG GGA TAC ACG TCG GGC ACC GGA CAG GGT AAC GCC TCG GCA ACC AAG GAC GGC 
val thr leu gly tyr thr ser gly thr gly gin gly asn ala ser ala thr lys asp gly 
183/61 213/71 

AGC CAC TAC AAG ATC ACA GGG TGA AGC GTG GAC TGA CGG TCG CGG TAG CCG CAG CCG CCA 
ser his tyr lys ile thr gly) OPA ser val asp OPA arg ser arg AMB (pro glu pro pro 
243/81 273/91 \£> Z 

TTC TGG TCG CAG GTC TTT CCG GAT GTT CAA GCA ACA AGT CGA CTA CAG GAA GCG GTG AGA 
phe trp ser gin val phe pro asp val gin ala thr ser arg leu gin glu ala val arg 
303/101 333/111 

CCA CGA CCG CGG CAG GCA CGA CGG CAA GCC CCG GCG CCG CTC CGG GCC GAA GGT CGT CAT 
pro arg pro arg gin ala arg arg gin ala pro ala pro leu arg ala glu gly arg his 
363/121 393/131 

CGA CGG TAA GGA CCA GAA CGT CAC CGG CTC CGT GGT GTG CAC AAC CGC GGC CGG CAA TGT 
arg arg)oCH^gly pro glu arg his arg leu arg gly val his asn arg gly arg gin cys 
423/141 Se©lDKo.6i3 453/151 

CAA CAT CGC GAT CGG CGG GGC GGC GAC CGG CAT TGC CGC CGT GCT CAC CGA CGG CAA CCC 
gin his arg asp arg arg gly gly asp arg his cys arg arg ala his arg arg gin pro 

. . ,. • 483/161 . . 513/171 , ..... . 

TCC GGA GGT GAA GTC CGT TGG GCT CGG TAA CGT CAA CGG CGT CAC GCT GGG ATA CAC GTC 
ser gly gly glu val arg trp ala arg)oCH(arg gin arg arg his ala gly ile his val 
54 3/181 Se#i&*,5* 573/191 

GGG CAC CGG ACA GGG TAA CGC CTC GGC AAC CAA GGA CGG CAG CCA CTA CAA GAT C 
gly his arg thr gly) OCH /arg leu gly dsn gin gly arg gin pro leu gin asp) 



SEQ ID fco&.5/a-6/5 
FIGURE 23C 



91/185 



l(S> to*©.** (cCT A ACG AC A 
2^ i£> Ko.5iT leu thr thr 

61/21 

CGC CAT TCT 
arg his ser 
121/41 
TGA GAC CAC 
res rt> to.fti8^QBa/asp his 
181/61 
GTC ATC GAC 
val ile asp 
241/81 
AAT GTC AAC 
asn val asn 
301/101 
AAC CCT CCG 
asn pro pro 
361/121 
ACG TCG GGC 
thr ser gly 



GGC AAA GGA GCA CAG GGT 
gly lys gly ala gin gly 

GGT CGC AGG TCT TTC CGG 
gly arg arg ser phe arg 

GAC CGC GGC AGG CAC GAC 
asp arg gly arg his asp 

GGT AAG GAC CAG AAC GTC 
gly lys asp gin asn val 

ATC GCG ATC GGC GGG GCG 
ile ala ile gly gly ala 

GAG GTG AAG TCC GTT GGG 
glu val lys ser val gly 

ACC GGA CAG GGT AAC GCC 
thr gly gin gly asn ala 



31/11 

GAA GCG TGG ACT GAC GGT 
glu ala trp thr asp gly 
91/31 

ATG TTC AAG CAA CAA GTC 
met phe lys gin gin val 
151/51 

GGC AAG CCC CGG CGC CGC 
gly lys pro arg arg arg 
211/71 

ACC GGC TCC GTG GTG TGC 
thr gly ser val val cys 
271/91 

GCG ACC GGC ATT GCC GCC 
ala thr gly ile ala ala. 

331/111 
CTC GGT AAC GTC AAC GGC 
leu gly asn val asn gly 

391/131 
TCG GCA ACC AAG GAC GGC 
ser ala thr lys asp gly 



CGC GGT AGC CGG AGC 
arg gly ser arg ser 

GAC TAC AGG AAG CGG 
asp tyr arg lys argj 

TCC GGG CCG AAG GTC 
ser gly pro lys val 

ACA ACC GCG GCC GGC 
thr thr ala ala gly 

GTG CTC ACC GAC GGC 
val leu thr asp gly 

GTC ACG CTG GGA TAC 
val thr leu gly tyr 

AGC CAC TAC AAG ATC 
ser his tyr lys ile^ 



SEQ ID lfc£>. 5**- 
FIGURE 24A 



32/11 

£&> iV>Ht>^ TAA CGA CAG GCA AAG GAG CAC AGG GTG AAG CGT GGA CTG ACG GTC GCG GTA. GCC GGA GCC 
OCHYarg gin ala lys glu his arg val lys arg gly leu thr val ala val ala gly ala 

6> li^Ko. 620 62/21 92/31 

GCC ATT CTG GTC GCA GGT CTT TCC GGA TGT TCA AGC AAC AAG TCG ACT ACA GGA AGC GGT 
ala ile leu val ala gly leu ser gly cys ser ser asn lys ser thr thr gly ser gly 
122/41 152/51 

GAG ACC ACG ACC GCG GCA GGC ACG ACG GCA AGC CCC GGC GCC GCT CCG GGC CGA AGG TCG 
glu thr thr thr ala ala gly thr thr ala ser pro gly ala ala pro gly arg arg ser 
182/61 212/71 

TCA TCG ACG GTA AGG ACC AGA ACG TCA CCG GCT CCG TGG TGT GCA CAA CCG CGG CCG GCA 
ser ser thr val arg thr arg thr ser pro ala pro trp cys ala gin pro arg pro ala 
242/81 272/91 

ATG TCA ACA TCG CGA TCG GCG GGG CGG CGA CCG GCA TTG CCG CCG TGC TCA CCG ACG GCA 
met ser thr ser arg ser ala gly arg arg pro ala leu pro pro cys ser pro thr ala 
302/101 332/111 

ACC CTC CGG AGG TGA AGT CCG TTG GGC TCG GTA ACG TCA ACG GCG TCA CGC TGG GAT ACA 
thr leu arg arg^OPA /ser pro leu gly ser val thr ser thr ala ser arg trp asp thr 
36 ^ /121 «SecS 392/131 



CGT CGG GCA CCG GAC AGG GTA ACG CCT CGG CAA CCA AGG ACG GCA GCC ACT ACA AGA TC 
arg arg ala pro asp arg val thr pro arg gin pro arg thr ala ala thr thr argj 



SEQ ID )fo.f>iq*£>2l 
FIGURE 24B 



92/185 



33/11 

KS> ifc>Kt>*&22 AAC GAC AGG CAA AGG AGC ACA GGG TGA AGC GTG GAC TGA CGG TCG CGG TAG CCG GAG CCG 
ic£> u> Ko. 523 (asn asp arg gin arg ser thr gly)oPA ser val asp OPA arg ser arg AMB (pro glu pro 
63/21 93/31 Se6.iDKo.e24 

CCA TTC TGG TCG CAG GTC TTT CCG GAT GTT CAA GCA ACA AGT CGA CTA CAG GAA. GCG GTG 
pro phe trp ser gin val phe pro asp val gin ala thr ser arg leu gin glu ala val 
123/41 153/51 

AGA CCA CGA CCG CGG CAG GCA CGA CGG CAA GCC CCG GCG CCG CTC CGG GCC GAA GGT CGT 
arg pro arg pro arg gin ala arg arg gin ala pro ala pro leu arg ala glu gly arg 
183/61 213/71 

CAT CGA CGG TAA GGA CCA GAA CGT CAC CGG CTC CGT GGT GTG CAC AAC CGC GGC CGG CAA 
his arg arg\ OCH (gly pro glu arg his arg leu arg gly val his asn arg gly arg gin 
243/81 secs> »t> 273/91 

TGT CAA CAT CGC GAT CGG CGG GGC GGC GAC CGG CAT TGC CGC CGT GCT CAC CGA CGG CAA 
cys gin his arg asp arg arg gly qly asp arg his cys arg arg ala his arg arg gin 
303/101 333/111 

CCC TCC GGA GGT GAA GTC CGT TGG GCT CGG TAA CGT CAA CGG CGT CAC GCT GGG ATA CAC 
pro ser gly gly glu val arg trp ala arg) OCH (arg gin arg arg his ala gly ile his 
363/121 ' 393/131 iD l*o. €>2<p 

GTC GGG CAC CGG ACA GGG TAA CGC CTC GGC AAC CAA GGA CGG CAG CCA CTA CAA GAT C 
val gly his arg thr gly\oCH(arg leu gly asn gin gly arg gin pro leu gin -asp\ 

SEQ ID {Coe>. 522-5^7 
FIGURE 24C 



Direct primer 
tL> MX>- £>2*S 5' ACG CGG CGC AGC CTG TTG 3' 

SEQ ID fto . 
FIGURE 25 

Reverse primer 
-56<S> *fc*o. 62? 5 1 CGA CCT TGG GAT TCG CCT 3» 



SEQ ID 

FIGURE 26 



93/185 



31/11 

^ |£>A©. Z&D CCT ACC AGC AAG AGC CCA GGG CTT CAC AGG ACC TAA AAG GAG TAG CGC CCA TGG GCT TGA 

iC>*<£>. £31 pro thr ser lys ser pro gly leu his arg thAoCH lys glu AMB^arg pro trp ala) OPA 

61/21 91/31 Kd. 6Si 

TCC AAT TTT CCT TCC GCC CCG TGC AAT ACC ATC TGC AAG ACC AGC GAC GGC CCG TGG TTG 

(ser asn phe pro ser ala pro cys asn thr ile cys lys thr ser asp gly pro trp leu 

2S>G>U>..S33 121/41 151/51 

CGG TCG CGC AGC TTG CGG AAA CGG GGT ATG GAC CCT GCC GTA CCG TTG TTG CCA CTT GAT 

arg ser arg ser leu arg lys arg gly met asp pro ala val pro leu leu pro leu asp 

181/61 211/71 

GTC GTC GCT CTC CAC CCG TCG GGG GGC GAA AGC CAT TCC GAC ACT GGG ATC CTC AAA ACG 

val val ala leu his pro ser gly gly glu ser his ser asp thr gly ile leu lys thr 

241/81 271/91 

TCG GCT GAG TGT CTG CAG GGC TCC GGG GAG CAG CCG ATC ATC ACC ATG TAC GAA CTG AAT 

ser ala glu cys leu gin gly ser gly glu gin pro ile ile thr met tyr glu leu asn 

301/101 331/111 

AAG TCC CCC CCG CGC GAC TTC CAG ACA TTT GTT GTG GTT TCG GTT GAG GCC GAG GCG AGG 

lys ser pro pro arg asp phe gin thr phe val val val ser val glu ala glu ala arg 

361/121 391/131 

CTC ATT TCG CAG CAA GCG GTC TCC GGG TCG CAG CAT CGT TGC GGC GAT CGC GGC GCA GTC 

leu ile ser gin gin ala val ser gly ser gin his arg cys gly asp arg gly ala val 
421/141 

GTC GGA CGA GTC GTC GTC AAC GAC CAC GAT C 

val gly arg val val val asn asp his asp) 



SEQ ID fco=>.£i3C>-533 
FIGURE 27A 



31/11 

Ifi> £>(to. e3H CTA CCA GCA AGA GCC CAG GGC TTC ACA GGA CCT AAA AGG AGT AGC GCC CAT GGG CTT GAT 

G>UjoS35 leu pro ala arg ala gln gly pne tnr gly pr ° lyS arg Ser Ser ala niS gly leU aSp 
^* 61/21 91/31 

CCA ATT TTC CTT CCG CCC CGT GCA ATA CCA TCT GCA AGA CCA GCG ACG GCC CGT GGT TGC 

pro ile phe leu pro pro arg ala ile pro ser ala arg pro ala thr ala arg gly cys 

121/41 151/51 

GGT CGC GCA GCT TGC GGA AAC GGG GTA TGG ACC CTG CCG TAC CGT TGT TGC CAC TTG ATG 
gly arg ala ala cys gly asn gly val trp thr leu pro tyr arg cys cys his leu met 
181/61 211/71 

TCG TCG CTC TCC ACC CGT CGG GGG GCG AAA GCC ATT CCG ACA CTG GGA TCC TCA AAA CGT 
ser ser leu ser thr arg arg gly ala lys ala ile pro thr leu gly ser ser lys arg 
241/81 271/91 

CGG CTG AGT GTC TGC AGG GCT CCG GGG AGC AGC CGA TCA TCA CCA TGT ACG AAC TGA ATA 
arg leu ser val cys arg ala pro gly ser ser arg ser ser pro cys thr asn^ OPACile 
301/101 331/111 :S£<^ to Ko. 63Cp 

AGT CCC CCC CGC GCG ACT TCC AGA CAT TTG TTG TGG TTT CGG TTG AGG CCG AGG CGA GGC 

ser pro p ro~ arrg - a"l~a~thr~ ser~ arg— hl~s~l-eu~rexi^xp— plie^xg—l~exi— axg pro ary dry gl-y - 

361/121 391/131 

TCA TTT CGC AGC AAG CGG TCT CCG GGT CGC AGC ATC GTT GCG GCG ATC GCG GCG CAG TCG 
ser phe arg ser lys arg ser pro gly arg ser ile val ala ala ile ala ala gin ser 
421/141 

TCG GAC GAG TCG TCG TCA ACG ACC ACG ATC 
ser asp glu ser ser ser thr thr thr ile) 



SEQ ID Ka&.63V-5<36* 
FIGURE 27B 



94/185 



33/11 



TAC CAG 


CAA 


GAG 


CCC 


AGG 


GCT 


TCA 


CAG 


GAC 


CTA AAA 


GGA 


GTA 


GCG 


CCC 


ATG 


GGC 


TTG 


ATC 


tyr gin 


gin 


glu 


pro 


arg 


ala 


ser 


gin 


asp 


leu lys 


gly 


val 


ala 


pro 


met 


gly 


leu 


ile 


63/21 


















93/31 


















CAA TTT 


TCC 


TTC 


CGC 


CCC 


GTG 


CAA 


TAC 


CAT 


CTG CAA 


GAC 


CAG 


CGA 


CGG 


CCC 


GTG 


GTT 


GCG 


gin phe 


ser 


phe 


arg 


pro 


val 


gin 


tyr 


his 


leu gin 


asp 


gin 


arg 


arg 


pro 


val 


val 


ala 


123/41 


















153/51 


















GTC GCG 


CAG 


CTT 


GCG 


GAA 


ACG 


GGG 


TAT 


GGA 


CCC TGC 


CGT 


ACC 


GTT 


GTT 


GCC 


ACT 


TGA 


TGT 


val ala 


gin 


leu 


ala 


glu 


thr 


gly 


tyr 


gly 


pro cys 


arg 


thr 


val 


val 


ala 


thr)OPA^cys 


183/61 


















213/71 














CGT CGC 


TCT 


CCA 


CCC 


GTC 


GGG 


GGG 


CGA 


AAG 


CCA TTC 


CGA 


CAC 


TGG 


GAT 


CCT 


CAA 


AAC 


GTC 


arg arg 


ser 


pro 


pro 


val 


gly 


gly 


arg 


lys 


pro phe 


arg 


his 


trp 


asp 


pro 


gin 


asn 


val 


243/81 


















273/91 


















GGC TGA 


GTG 


TCT 


GCA 


GGG 


CTC 


CGG 


GGA 


GCA 


GCC GAT 


CAT 


CAC 


CAT 


GTA 


CGA 


ACT 


GAA 


TAA 


gly) OPA 


(val 


ser 


ala 


gly 


leu 


arg 


gly 


ala 


ala asp 


his 


his 


his 


val 


arg 


thr 


glu) 


OCH 


303/101 












333/111 
















GTC CCC 


CCC 


GCG 


CGA 


CTT 


CCA 


GAC 


ATT 


TGT 


TGT GGT 


TTC 


GGT 


TGA 


GGC 


CGA 


GGC 


GAG 


GCT 


{yal pro 


pro 


ala 


arg 


leu 


pro 


asp 


ile 


cys 


cys gly 


phe 


giy) 


OPA 


(giy 


arg 


gly 


glu 


ala 


363/121 


















393/131 








ID *o. ev£ 




CAT TTC 


GCA 


GCA 


AGC 


GGT 


CTC 


CGG 


GTC 


GCA 


GCA TCG 


TTG 


CGG 


CGA 


TCG 


CGG 


CGC 


AGT 


CGT 


his phe 


ala 


ala 


ser 


gly 


leu 


arg 


val 


ala 


ala ser 


leu 


arg 


arg 


ser 


arg 


arg 


ser 


arg 


423/141 




































CGG ACG 


AGT 


CGT 


CGT 


CAA 


CGA 


CCA 


CGA 


TC 




















arg thr 


ser 


arg 


arg 


gin 


arg 


pro 


arg) 























SEQ ID 637' 
FIGURE 27C 



tseCD lOtU>. MKT GT AT T RRRL LAVL I ALAL P GAAVALLAE P SAT GAS D P CAAS EVAR 

TVGSVAKSMGDYLDSHPETNQVMTAVLQQQVGPGSVASLKAHFEANPK 
VAS D L HAL S Q P LT D L S T RC S L P I S GLQAI G LMQAVQGARR 



SEQ ID fto. £></3 
FIGURE 28 



\vD^.^V GTGGGCAAGC AGCTAGCCGC GCTCGCCGCG CTGGTCGGTG CGTGCATGCT CGCAGCCGGA 60 

TGCACCAACG TGGTCGACGG GAC CGC CGT G GCTGCCGACA AATCCGGACC ACT GC AT CAG 12 0 

GAT C C GAT AC CGGTTTCAGC GCTTGAAGGG CTGCTTCTCG ACTTGAGCCA GAT CAAT GCC 18 0 

GCGCTGGGTG C GAC AT C GAT GAAGGTGTGG TTCAACGCCA AGGCAATGTG GGACT GGAGC 24 0 

AAGAGCGTGG CCGACAAGAA TTGCCTGGCT ATC GAC GGT C CAGCACAGGA AAAGGTCTAT 300 

GCCGGCACCG -GGT£GACCGC_TAT.G.CGCG^^ Sfifl. 

AAGAAAC GC G ACCACTACGC CAT T CAAGC G GTCGTCGGCT TCCCGACCGC AC AT GAT GC C 420 

GAGGAGTTCT ACAGCTCCTC GGTGCAAAGC TGGAGCAGCT GCTCGAACCG CCGGTTTGTC 4 80 

GAAGTCACCC C C G GAC AGG A CGACGCCGCC TGGACTGTGG CTGACGTTGT CAACGACAAC 54 0 

GGCATGCTCA GTAGCTCGCA GGTTCAGGAA GGC GGC GAC G GATGGACCTG CCAGCGTGCC 600 

CTGACTGCGC GCAACAACGT C ACT AT C GAC ATT GT CAC GT GCGCCTATAG CCAACCGGAT 660 

TTGGTGGCGA TTGGCATCGC TAACCAAATC GCGGCCAAGG TTGCTAAGCA GTAG 714 



SEQ ID fiU, 6V</ 
FIGURE 29 



95/185 



>e<d ii^> l^c. €><73 MGKQLAAl*AALVGACMIAAGCThT\A/T)GTAVA 
MKWFNAKAMWDWS KS VADKNCIAI DG PAQ 

GFPTAHDAEEFYSSSVQSWSSCSNRRFVEVTFTPGQDDAAWTVADVWDNGMLSSSQVQEGGDGWTCQ 
RALT ARNNVT I D I VT CAY S Q P D LVFT AI G I ANQ I AAKVAKQ 



SEQ ID Kr>.6<4£> 
FIGURE 30 



31/11 

GCG AGG GCA GCG CGA CGG CGG CCC TGC CGG CGC CGT GGC TGC TGA ACA 
ala arg ala ala arg arg arg pro cys arg arg arg gly cys) OPA^thr 

91/31 se<S> /& . 

CGC GCA CGC TTC CGG TAT GCG GCA GGA TAA ACG ACC CCA ACA GCA CGA 
arg ala arg phe arg tyr ala ala gly)oCH(thr thr pro thr ala arg 

151/51 sa£> 
CGA CAA CCA AAG CCC TCG CGC CTG GCT CGA TTT CGC GCG CAA CGC GGC 
arg gin pro lys pro ser arg leu ala arg phe arg ala gin arg gly 

211/71 

CGA TCT CAG CGC GGA GGG CGT CGA GAT C 
arg ser gin arg gly gly arg arg asp^ 

SEQ ID 

FIGURE 31A 



1/1 


















31/11 


















cDlb^e60GGC GAA 


TAC 


CCG 


CGA 


GGG 


CAG 


CGC 


GAC 


GGC 


GGC CCT 


GCC 


GGC 


GCC 


GTG 


GCT 


GCT 


GAA 


CAA 


it> iA-o.es/ giy giu 

61/21 


tyr 


pro 


arg 


gly 


gin 


arg 


asp 


gly 


gly pro 


ala 


gly 


ala 


val 


ala 


ala 


glu 


gin 


















91/31 


















CAC ATC 


CCA 


GCC 


GCG 


CAC 


GCT 


TCC 


GGT 


ATG 


CGG CAG 


GAT 


AAA 


CGA 


CCC 


CAA 


CAG 


CAC 


GAA 


his ile 


pro 


ala 


ala 


his 


ala 


ser 


gly 


met 


arg gin 


asp 


lys 


arg 


pro 


gin 


gin 


his 


glu 


121/41 


















151/51 


















CAC CAG 


GAT 


TGC 


GAC 


AAC 


CAA 


AGC 


CCT 


CGC 


GCC TGG 


CTC 


GAT 


TTC 


GCG 


CGC 


AAC 


GCG 


GCG 


his gin 


asp 


cys 


asp 


asn 


gin 


ser 


pro 


arg 


ala trp 


leu 


asp 


phe 


ala 


arg 


asn 


ala 


ala 


181/61 


















211/71 


















TTC TGC 


CGC 


CTC 


GAT 


CTC 


AGC 


GCG 


GAG 


GGC 


GTC GAG 


ATC 
















phe cys 


arg 


leu 


asp 


leu 


ser 


ala 


glu 


gly 


val glu 


ile | 


) 















SEQ ID tU3i>. eoO-SbJ 



1/1 

SO to *o. ©V<p AGG CGA ATA CCC 

lb ^£N7 ( ar S ar S ile P ro 
61/21 

ACA CAT CCC AGC 
thr his pro ser 
121/41 

ACA CCA GGA TTG 
thr pro gly leu 
181/61 

., ,., GTT ,CTG CCG fl CCT 

val leu pro pro 



FIGURE 31B 



1/1 



l& l£>A*>.e£>2. GCG 




cnc 






AGC 


/— 1^ r~ 


TS.CC 


4la asn 


thr 


arg 


glu 


gly 


ser 


ala 


thr 










ACA TCC 


CAG 


CCG 


CGC 


ACG 


CTT 


CCG 


GTA 


thr ser 


gin 


pro 


arg 


thr 


leu 


pro 


val 


121/41 
















ACC AGG 


ATT 


GCG 


ACA 


ACC 


AAA 


GCC 


CTC 


thr arg 


ile 


ala 


thr 


thr 


lys 


ala 


leu 


. 181/61 
















TCT GCC 


GCC 


TCG 


ATC 


TCA 


GCG 


CGG 


AGG 


ser ala 


ala 


ser 


ile 


ser 


ala 


arg 


arg 



96/185 





11 /l 1 






















CCG 


GCG 


CCG 




Liu 


CTG 


AAC 


AAC 


ala 


ala leu 
91/31 


pro 


ala 


pro 




leu 


leu 


asn 


asn 


TGC 


GGC AGG 


ATA 


AAC 


GAC 


CCC 


AAC 


AGC 


ACG 


AAC 


cys 


gly arg 
151/51 


ile 


asn 


asp 


pro 


asn 


ser 


thr 


asn 


GCG 


CCT GGC 


TCG 


ATT 


TCG 


CGC 


GCA 


ACG 


CGG 


CGT 


ala 


pro gly 
211/71 


ser 


ile 


ser 


arg 


ala 


thr 


arg 


arg 


GCG 


TCG AGA 


TC 
















ala 


ser arg % 



















SEQ ID Kr*. 562-563 
FIGURE 31C 



ORF according to Cole et al . (Nature 393:537-544) and containing seq31A 
1/1 31/11 

e&io*i>. taa ac< ? acc cca aca gca cga aca cca gga ttg cga caa cca aag ccc tcg cgc ctg gct 

^ , OCHJthr thr pro thr ala arg thr pro gly leu arg gin pro lys pro ser arg leu ala 

<lq fc*o. 6e£>^j^ 1 F 91/31 

cga ttt cgc gcg caa cgc ggc gtt ctg ccg cct cga tct cag cgc gga ggg cgt cga gat . 
arg phe arg ala gin arg gly val leu pro pro arg ser gin arg gly gly arg arg asp 
121/41 151/51 

ccc egg cgt cgt gtt cgt ggc tea tea tct gca tec tec ggg ctt ggc cgc gct gac egg 
pro arg arg arg val arg gly ser ser ser ala ser ser gly leu gly arg ala asp arg 
' ' 181/61 ' 211/71 

cag ccc gac ccc agg cat gec cag gee gac ggc gcg ccc egg ctg ccc ggc ggt gtg cgc 
gin pro asp pro arg his ala gin ala asp gly ala pro arg leu pro gly gly val arg 
241/81 271/91 

gtc gee ggc gcg ggt gcg gcg gtg ggt cag gac gee ggc gtc ggc gat gag gtg gtg egg 
val ala gly ala gly ala ala val gly gin asp ala gly val gly asp glu val val arg 
301/101 331/111 

cgc cgc ttc ggt gac ctt cgt ggt gat gac gtc gee ggg acg cac gcg egg ctg gee ggc 
arg arg phe gly asp leu arg gly asp asp val ala gly thr his ala arg leu ala gly 
361/121 391/131 

ggt gaa gtg cac cag gcg ccc gtc gcg cgc ccg ccc gct cat gcg cgc cgt gac ggt gtc 
gly glu val his gin ala pro val ala arg pro pro ala his ala arg arg asp gly val 
421/141 451/151 

ctt gcg ccc ttc ccc ggt ggc cac cag cac etc gac ggc ctg ccc gac cag ggc gcg gtt 
leu ala pro phe pro gly gly his gin his leu asp gly leu pro asp gin gly ala val 
481/161 511/171 

ggc ttc cag cga gat ttg etc ctg cag cgc gat cag gcg ttc ata gcg ttc ctg cac aac 
gly phe gin arg asp leu leu leu gin arg asp gin ala phe ile ala phe leu his asn 
541/181 571/191 

ggc ttt egg cag ctg tec gtc gag ttg cgc ggc egg tgt ccc ggg ccg ctt gga gta ttg 
gly phe arg gin leu ser val glu leu arg gly arg cys pro gly pro leu gly val leu 
601/201 631/211 

gaa ggt aaa tgc ggc cgc gaa gcg ggc ccg gcg cac cac gtc gag cgt ggc cgc gaa gtc 
glu gly lys cys gly arg glu ala gly pro ala his his val glu arg gly arg glu val 
661/221 691/231 

etc ttc ggt etc ccc ggg gaa acc gac gat cag ate ggt ggt aat cgc ggc atg egg gat 
leu phe gly leu "pro gly glu thr asp ^asp gin iTe" gly gly asn arg gly "trier aTg" asp 
721/241 751/251 

ggc cgc ccg cac gcg etc gat gat gee gag gta gcg etc ggc acg ata gga ccg ccg cat 
gly arg pro his ala leu asp asp ala glu val ala leu gly thr ile gly pro pro his 
781/261 811/271 
cgc gcg cag gat ccg gtc gga tec gga ctg tag 
arg ala gin asp pro val gly ser gly leu\AMB 



SEQ ID Kr*b. 56*7-565 
FIGURE 31; O 



97/185 



1/1 31/11 

it>lfo, SbCp aga ctg gtg tac acg gag acc aag ctg aac teg gca ttc tec ttc ggc ggg cct aag tgt 

9 <s> c> kx>. ^?&7 ar 9 leu va -^ t y r t ^ lr ttir iy s ^- eu asn ser a ^ a P he ser p^ e siy siy P ro iy s c y s 

61/21 91/31 

eta gtg aag gtc att cag aaa ctg teg ggc ttg age ate aac egg ttc ate gcg att gac 

leu val lys val ile gin lys leu ser gly leu ser ile asn arg phe ile ala ile asp 

121/41 151/51 

ttc gtc ggt ttc gcg egg atg gtc gag gee etc ggc ggc gtc gag gta tgc age acc acc 

phe val gly phe ala arg met val glu ala leu gly gly val glu val cys ser thr thr 

181/61 211/71 

ccg ttg egg gac tac gaa ctg ggc acg gtg ctg gag cac gee gga cgc cag gtc att gac 

pro leu arg asp tyr glu leu gly thr val leu glu his ala gly arg gin val ile asp 

241/81 271/91 

ggg ccg acc gcg ctg aac tat gtg cgc get cgc cag gtc acc acc gag age aat ggc gac 

gly pro thr ala leu asn tyr val arg ala arg gin val thr thr glu ser asn gly asp 

301/101 331/111 

tac ggg cgc ate aaa cgc cag cag ttg ttt ttg teg teg ctg ctg cgt teg atg ate 

tyr gly arg ile lys arg gin gin leu phe leu ser ser leu leu arg ser met ile) 



SEQ ID toe>.£6Co-eb7 
FIGURE 32A 



1/1 31/11 
£&) to ilo. ££& gac tgg tgt aca egg aga cca age tga act egg cat tct cct teg gcg ggc eta agt gtc 

r & fc> *5©? ^ asp tr P cys tnr arg arg pro ser ^ OPA fc hr ar 9 his ser P ro ser ala gly leu ser val\ 
61/21 ' 3ta>iDKf>£«<91/31- 

tag tga agg tea ttc aga aac tgt egg get tga gca tea acc ggt tea teg cga ttg act 
AMB OPA/arg ser phe arg asn cys arg ala\OPA (ala ser thr gly ser ser arg leu thr 
121/41 seSJ/l>iU>.^^ 7 151/51 S£G> /DA^6toZ 

teg teg gtt teg cgc gga tgg teg agg ccc teg gcg gcg teg agg tat gca gca cca ccc 
ser ser val ser arg gly trp ser arg pro ser ala ala ser arg tyr ala ala pro pro 
181/61 211/71 

cgt tgc ggg act acg aac tgg gca egg tgc tgg age acg ccg gac gee agg tea ttg acg 
arg cys gly thr thr asn trp ala arg cys trp ser thr pro asp ala arg ser leu thr 
241/81 271/91 

ggc cga ccg cgc tga act atg tgc gcg etc gee agg tea cca ccg aga gca atg gcg act 
gly arg pro arg) OPA (thr met cys ala leu ala arg ser pro pro arg ala met ala thr 
301/101 :5e<D/J>/fe>6Ge3 331/111 

acg ggc gca tea aac gee age agt tgt ttt tgt cgt cgc tgc tgc gtt cga tga tc 
thr gly ala ser asn ala ser ser cys phe cys arg arg cys cys val arg)0PA 



SEQ ID Jjios, 5b?- E>U3 
FIGURE 32B 



98/185 



l/i 

UbtoAo. 6<oC/ act ggt gta cac gga gac caa get gaa 

(thr gly val his gly asp gin ala glu 

& folA*x>.^kf) V 6i/21 

agt gaa ggt cat tea gaa act gtc ggg 

ser glu gly his ser glu thr val gly 
121/41 

cgt egg ttt cgc gcg gat ggt cga ggc 

arg arg phe arg ala asp gly arg gly 
181/61 

gtt gcg gga eta cga act ggg cac ggt 

val ala gly leu arg thr gly his gly 
241/81 

gee gac cgc get gaa eta tgt gcg cgc 

ala asp arg ala glu leu cys ala arg 
301/101 

egg gcg cat caa acg cca gca gtt gtt 

arg ala his gin thr pro ala val val 



31/11 

etc ggc att etc ctt egg egg gec taa gtg tct 
leu gly ile leu leu arg arg alaJoCH^val ser 

ctt gag cat caa ccg gtt cat cgc gat tga ctt 
leu glu his gin pro val his arg aspl OPA /leu 

151/51 25£<& ,bWo.e<©7 

cct egg egg cgt cga ggt atg cag cac cac ccc 
pro arg arg arg arg gly met gin his his pro 

211/71 

get gga gca cgc egg acg cca ggt cat tga egg 
ala gly ala arg arg thr pro gly his)OPA(arg 

271/91 SeaMfc\M>£&>S 
teg cca ggt cac cac cga gag caa tgg cga eta 
ser pro gly his his arg glu gin trp arg leu 

331/111 

ttt gtc gtc get get gcg ttc gat gat c 
phe val val ala ala ala phe asp asp) 



SEQ ID K^>.56?t/-£>6?Sr 
FIGURE 32C 



sequence Rv0822c predicted by Cole et al . (Nature 393:537-544) and containing seq 32A 
1/1 ' ' 31/11 



£&it)t<4*£fc9 atg agt 


gac 


ggc 


gag 


age 


gee 


gcg 


ccg 


tgg 


gca egg 


etc 


tec 


gag 


tea 


gca 


ttc 


ccc 


gat 


^ ^ , /ket ser 
^eda iXZiio.£>70 61/21 


asp 


gly 


glu 


ser 


ala 


ala 


pro 


trp 


ala arg 
91/31 


leu 


ser 


glu 


ser 


ala 


phe 


pro 


asp 


ggt gtt 


gac 


cga 


tgg 


ate 


acg 


gta 


ccg 


ccc 


gee aca 


tgg 


gtg 


gca 


gee 


cag 


ggt 


ccg 


egg 


gly val 


asp 


arg 


trp 


ile 


thr 


val 


pro 


pro 


ala thr 


trp 


val 


ala 


ala 


gin 


gly pro 


arg 


121/41 


















151/51 


















gac ace 


cag 


aat 


gtc 


ggc 


tgt 


cat 


gee 


acc 


ggc gee 


gtt 


agt 


gtg 


gee 


gat 


ctg 


ate 


gee 


asp thr 


gin 


asn 


val 


gly 


cys 


his 


ala 


thr 


gly ala 


val 


ser 


val 


ala 


asp 


leu 


ile 


ala 


181/61 


















211/71 


















agg etc 


ggc 


ccc 


get 


ttt 


cct 


gac 


etc 


ccc 


acg cac 


cgc 


cat 


gtc 


gee 


ccc 


gaa 


ccc 


gag 


arg leu 


gly pro 


ala 


phe 


pro 


asp 


leu 


pro 


thr his 


arg 


his 


val 


ala 


pro 


glu pro 


glu 


241/81 


















271/91 


















cca tec 


ggc 


cgc 


ggc 


ccg 


aag 


gtc 


cac 


gac 


gac gee 


gac 


gac 


cag 


cag 


gac 


acc 


gag 


get 


pro ser 


gly 


arg 


gly 


pro 


lys 


val 


his 


asp 


asp ala 


asp 


asp 


gin 


gin 


asp 


thr 


glu 


ala 


301/101 


















331/111 


















ate gee 


ate 


ccg 


gee 


cac 


teg 


etc 


gag 


ttc 


etc teg 


gag 


ctt 


ccc 


gac 


etc 


egg 


gca 


gee 


ile ala 


ile 


pro 


ala 


his 


ser 


leu 


glu 


phe 


leu ser 


glu 


leu 


pro 


asp 


leu 


arg 


ala 


ala 


361/121 


















391/131 


















aac tat 


ccg 


cgc 


gee 


gac 


cac 


gee 


cgc 


cgt 


gaa ccc 


gag 


eta 


ccc 


ggc 


aag 


cag 


eta 


acc 


asn tyr 


pro 


arg 


ala 


asp 


his 


ala 


arg 


arg 


glu pro 


glu 


leu 


pro 


gly 


lys 


gin 


leu 


thr 


421/141 


















451/151 


















gga teg 


get 


cga 


gtg 


egg 


cca 


ttg 


egg 


ate 


cgc cga 


acg 


teg 


ccc 


gcg 


ccc 


gee 


aag 


cca 


gly ser 


ala 


arg 


val 


arg 


pro 


leu 


arg 


ile 


arg arg 


thr 


ser 


pro 


ala 


pro 


ala 


lys 


pro 


481/161 


















511/171 


















gcg ccg 


aac 


tec 


ggc 


egg 


cgc 


ccg 


atg 


gtg 


ctg gee 


gcg 


cgc 


teg 


ctg 


gcg 


get 


ctg 


ttt 


ala pro 


asn 


ser 


gly 


arg 


arg 


pro 


met 


val" 


~leu~ ala 


ala 


arg 


- ser 


leu 


ala 


ala 


leu 


phe 


541/181 


















571/191 


















gee get 


ctg 


gcg 


ttg 


gcg 


ctg 


acc 


ggc 


ggg 


gca tgg 


cag 


tgg 


age 


gcg 


teg 


aag 


aac 


age 


ala ala 


leu 


ala 


leu 


ala 


leu 


thr 


gly gly 


ala trp 


gin 


trp 


ser 


ala 


ser 


lys 


asn 


ser 


601/201 


















631/211 


















egg ctg 


aac 


atg 


gta 


age 


gcg 


etc 


gac 


ccg 


cat teg 


ggc 


gac 


ate 


gtc 


aac 


ccc 


age 


ggg 


arg leu 


asn 


met 


val 


ser 


ala 


leu 


asp 


pro 


his ser 


gly 


asp 


ile 


val 


asn 


pro 


ser 


gly 



SEQ ID tust>. ^-570 



FIGURE 32D 



99/185 



661/221 

cag cat ggc gac gag aac ttc ttg etc gtc 
gin his gly asp glu asn phe leu leu val 
721/241 

aat ate ggc gcc ggc gac gcc gag gac gcc 
asn ile gly ala gly asp ala glu asp ala 
781/261 

gtc aac att ccg gcc age cgc gag egg gtc 
val asn ile pro ala ser arg glu arg val 
841/281 

ate act cca ate caa tgc gag gcg tgg aac 
ile thr pro ile gin cys glu ala trp asn 
901/301 

gac gag aag acg gga acg atg ggt ccc aga 
asp glu lys thr gly thr met gly pro arg 
961/321 

gca ttc tec ttc ggc ggg cct aag tgt eta 
ala phe ser phe gly gly pro lys cys leu 
1021/341 

age ate aac egg ttc ate gcg att gac ttc 
ser ile asn arg phe ile ala ile asp phe 
1081/361 

ggc ggc gtc gag gta tgc age acc acc ccg 
gly gly val glu val cys ser thr thr pro 
1141/381 

gag cac gcc gga cgc cag gtc att gac ggg 
glu his ala gly arg gin val ile asp gly 
1201/401 

cag gtc acc acc gag age aat ggc gac tac 
glh val 'thr thr glu ser asn gly asp tyr 
1261/421 

teg teg ctg ctg cgt teg atg ate teg acg 
ser ser leu leu arg ser met ile ser thr 
1321/441 

aac gtc gtc aac atg ttc ate ggt aac age 
asn val val asn met phe ile gly asn ser 
1381/461 

gtc gaa etc ggt cga teg ttg cag cat atg 
val glu leu gly arg ser leu gin his met 
1441/481 

ccg acc ggt ata acc gac cag aac ggc gac 
pro thr gly ile thr asp gin asn gly asp 
1501/501 

ctt ttc acc gcc ate ate gac gac gat ccg 
leu phe thr ala ile ile asp asp asp pro 
1561/521 

cgt ctg ggc aac acg ccg teg acc ccg ccg 
arg leu gly asn thr pro ser thr pro pro 
1621/541 

ctg acc aac gag att cag cac cag cag gtt 
leu thr asn glu ile gin his gin gin val 
1681/561 

cag gtc tct aac teg acc ggc cag gcc ggt 
gin val ser asn ser thr gly gin ala gly 
1741/581 

egg aac ggc ttc aac gtg atg get ccg. gac 
arg asn gly phe asn val met ala pro asp 
1801/601 

gtg ttt ttt teg ccc ggc aac gaa cag get 
val phe phe ser pro gly asn glu gin ala 
1861/621 

tea aag ate gag egg gtg acc ggg ate ggc 
ser lys ile glu arg val thr gly ile gly 



691/231 

ggt atg gac tct cgt gcc ggg gcg aac gcc 
gly met asp ser arg ala gly ala asn ala 
751/251 

ggc ggc gca cgt teg gac acc gtc atg ctg 
gly gly ala arg ser asp thr val met leu 
811/271 

gtc gcg gtg teg ttc ccc cgc gac ctg gcg 
val ala val ser phe pro arg asp leu ala 
871/291 

ccc gag acc ggt aag tac gga ccc ate tac 
pro glu thr gly lys tyr gly pro ile tyr 
931/311 

ctg gtg tac acg gag acc aag ctg aac teg 
leu val tyr thr glu thr lys leu asn ser 
991/331 

gtg aag gtc att cag aaa ctg teg ggc ttg 
val lys val ile gin lys leu ser gly leu 
1051/351 

gtc ggt ttc gcg egg atg gtc gag gcc etc 
val gly phe ala arg met val glu ala leu 
1111/371 

ttg egg gac tac gaa ctg ggc acg gtg ctg 
leu arg asp tyr glu leu gly thr val leu 
1171/391 , 

ccg acc gcg ctg aac tat gtg cgc get cgc 
pro thr ala leu asn tyr val arg ala arg 
1231/411 

ggg cgc ate aaa cgc cag cag ttg ttt ttg 
gly arg ile lys arg gin gin leu phe leu 
1291/431 

gac acc ttg ttc aac etc age agg etc aac 
asp thr leu phe asn leu ser arg leu asn 
1351/451 

tac gtg gac aac gtc aag acc aaa gac ctg 
tyr val asp asn val lys thr lys asp leu 
1411/471 

gcg gcc ggg cac gtc acg ttc gtg acc gtt 
ala ala gly his val thr phe val thr val 
1471/491 

gag ccc ccg cgt acc tec gac atg aag gcg 
glu pro pro arg thr ser asp met lys ala 
1531/511 

ctg ccc ctg gaa aac gat cac aac gcc cag 
leu pro leu glu asn asp his asn ala gin 
1591/531 

acc acc acc aag aag gcg ccg cag gcg ggt 
thr thr thr lys lys ala pro gin ala gly 
1651/551 

acg acg acc teg cca aaa gag gtc aca gtg 
thr thr thr ser pro lys glu val thr val 
1711/571 

ttg gcc acc acc gcc acc gat cag etc aag 
leu ala thr thr ala thr asp gin leu lys 
1771/591 

gac tac ccg agt teg ctg. ctg gcc acc aca 
asp tyr pro ser ser leu leu ala thr thr 
1831/611 

gcc gcc acc gtg gcc gcc gtg ttc ggc cag 
ala ala thr val ala ala val phe gly gin 
1891/631 

caa ctg gtc cag gtg gtg ctg ggc caa gac 
gin leu val gin val val leu gly gin asp 



SEQ IDKo^.c^?-^7i>(c ontinue<i x > 
FIGURE 32D (continued 1) 



100/185 



1921/641 

ttc age gcg gtg cgc 
phe ser ala val arg 
1981/661 

aac tec tec age cca 
asn ser ser ser pro 
2041/681 

acc acc tgc gag tag 
thr thr cys gluj AMB 



get ccc ctg ccg agt 
ala pro leu pro ser 

ccg acc aag ctg ccc 
pro thr lys leu pro 



1951/651 

ggc tec acc gtc age 
gly ser thr val ser 
2011/671 

gag gac ctg acg gtc 
glu asp leu thr val 



gtg cag ata age cgc 
val gin ile ser arg 

acc aac gec gec gac 
thr asn ala ala asp 



SEQ ID Has £>(*<1'£70( continued 2) 
FIGURE 32D (continued 2) 



ORF according to Cole et al . (Nature 393:537-544) and containing Rv0822c 

1/1 31/11 
-6Q ir> Ko. &1I tag gac atg agt gac ggc gag age gec gcg ccg tgg gca egg etc tec gag tea gca ttc 

AMB (asp met ser asp gly glu ser ala ala pro trp ala arg leu ser glu ser ala phe 
r JZ\Z>*£xS7Z 61/2 \ 91/31 

ccc gat ggt gtt gac cga tgg ate acg gta ccg ccc gec aca tgg gtg gca gee cag ggt 

pro asp gly val asp arg trp ile thr val pro pro ala thr trp val ala ala gin gly 

121/41 151/51 

ccg egg gac acc cag aat gtc ggc tgt ( cat gee acc ggc gee gtt agt gtg gee gat ctg 
pro arg asp thr gin asn val gly cys his ala thr gly ala val ser val ala asp leu' 
181/61 211/71 

ate gec agg etc ggc ccc get ttt cct gac etc ccc acg cac cgc cat gtc gec ccc gaa 
ile ala arg leu gly pro ala phe pro asp leu pro thr his arg his val ala pro glu 
241/81 271/91 

ccc gag cca tec ggc cgc ggc ccg aag gtc cac gac gac gee gac gac cag cag gac acc 
pro glu pro ser gly arg gly pro lys val his asp asp ala asp asp gin gin asp thr 
301/101 331/111 

gag get ate gee ate ccg gec cac teg etc gag ttc etc teg gag ctt ccc gac etc egg 
glu ala ile ala ile pro ala his ser leu glu phe leu ser glu leu pro asp leu arg 
361/121 391/131 

gca gee aac tat ccg cgc gec gac cac gee cgc cgt gaa ccc gag eta ccc ggc aag cag 
ala ala asn tyr pro arg ala asp his ala arg arg glu pro glu leu pro gly lys gin 
421/141 451/151 

eta acc gga teg get cga gtg egg cca ttg egg ate cgc cga acg teg ccc gcg ccc gee 
leu thr gly ser ala arg val arg pro leu arg ile arg arg thr ser pro ala pro ala 
481/161 511/171 

aag cca gcg ccg aac tec ggc egg cgc ccg atg gtg ctg gee gcg cgc teg ctg gcg get 
lys pro ala pro asn ser gly arg arg pro met val leu ala ala arg ser leu ala ala 
541/181 571/191 

ctg ttt gec get ctg gcg ttg gcg ctg acc ggc ggg gca tgg cag tgg age gcg teg aag 
leu phe ala ala leu ala leu ala leu thr gly gly ala trp gin trp ser ala ser lys 
601/201 631/211 

aac age egg ctg aac atg gta age gcg etc gac ccg cat teg ggc gac ate gtc aac ccc 
asn ser arg leu asn a&t val ser ala leu asp pro his ser gly asp ile val asn pro 
661/221 691/231 

age ggg cag cat ggc gac gag aac ttc ttg etc gtc ggt atg gac tct cgt gec ggg gcg 
ser gly gin his gly asp glu asn phe leu leu val gly met asp ser arg ala gly ala 
721/241 751/251 

aac gec aat ate ggc gec ggc gac gec gag gac gee ggc ggc gca cgt teg gac acc gtc 
asn ala asn ile gly ala gly asp ala glu asp ala gly gly ala arg ser asp thr val 
781/261 811/271 

atg ctg gtc aac att ccg gee age cgc gag egg gtc gtc gcg gtg teg ttc ccc cgc gac 
met leu val asn ile pro ala ser arg glu arg val val ala val ser phe pro arg asp 



SEQ ID R05. &1l 
FIGURE 32B 



101/185 



841/281 

ctg gcg ate act cca ate caa tgc gag gcg 
leu ala ile thr pro ile gin cys glu ala 
901/301 

ate tac gac gag aag acg gga acg atg ggt 
ile tyr asp glu lys thr gly thr met gly 
961/321 

aac teg gca ttc tec ttc ggc ggg cct aag 
asn ser ala phe ser phe gly gly pro lys 
1021/341 

ggc ttg age ate aac egg ttc ate gcg att 
gly leu ser ile asn arg phe ile ala ile 
1081/361 

gee etc ggc ggc gtc gag gta tgc age acc 
ala leu gly gly val glu val cys ser thr 
1141/381 

gtg ctg gag cac gec gga cgc cag gtc att 
val leu glu his ala gly arg gin val ile 
1201/401 

get cgc cag gtc acc acc gag age aat ggc 
ala arg gin val thr thr glu ser asn gly 
1261/421 

ttt ttg teg teg ctg ctg cgt teg atg ate 
phe leu ser ser leu leu arg ser met ile 
1321/441 

etc aac aac gtc gtc aac atg ttc ate ggt 
leu asn asn val val asn met phe ile gly 
1381/461 

gac ctg gtc gaa etc ggt cga teg ttg cag 
asp leu val glu leu gly arg ser leu gin 
1441/481 i ■ i 

acc gtt ccg acc ggt ata acc gac cag aac 
thr val pro thr gly ile thr asp gin asn 
1501/501 

aag gcg ctt ttc acc gee ate ate gac gac 
lys ala leu phe thr ala ile ile asp asp 
1561/521 

gee cag cgt ctg ggc aac acg ccg teg acc 
ala gin arg leu gly asn thr pro ser thr 
1621/541 

gcg ggt ctg acc aac gag att cag cac cag 
ala gly leu thr asn glu ile gin his gin 
1681/561 

aca gtg cag gtc tct aac teg acc ggc cag 
thr val gin val ser asn ser thr gly gin 
1741/581 

etc aag egg aac ggc ttc aac gtg atg get 
leu lys arg asn gly phe asn val met ala 
1801/601 

acc aca gtg ttt ttt teg ccc ggc aac gaa 
thr thr val phe phe ser pro gly asn glu 
1861/621 

ggc cag tea aag ate gag egg gtg acc ggg 
gly gin ser lys ile glu arg val thr gly 
1921/641 

caa gac ttc age gcg gtg cgc get ccc ctg 
gin asp phe ser ala val arg ala pro leu 
1981/661 

age cgc aac tec tec age cca ccg acc aag 
ser arg asn ser ser ser pro pro thr lys 
2041/681 

gee gac acc acc tgc gag tag 
ala asp thr thr cys glu\AMB 



871/291 

tgg aac ccc gag acc ggt aag tac gga ccc 
trp asn pro glu thr gly lys tyr gly pro 
931/311 

ccc aga ctg gtg tac acg gag acc aag ctg 
pro arg leu val tyr thr glu thr lys leu 
991/331 

tgt eta gtg aag gtc att cag aaa ctg teg 
cys leu val lys val ile gin lys leu ser 
1051/351 

gac ttc gtc ggt ttc gcg egg atg gtc gag 
asp phe val gly phe ala arg met val glu 
1111/371 

acc ccg ttg egg gac tac gaa ctg ggc acg 
thr pro leu arg asp tyr glu leu gly thr 
1171/391 

gac ggg ccg acc gcg ctg aac tat gtg cgc 
asp gly pro thr ala leu asn tyr val arg 
1231/411 

gac tac ggg cgc ate aaa cgc cag cag ttg 
asp tyr gly arg ile lys arg gin gin leu 
1291/431 

teg acg gac acc ttg ttc aac etc age agg 
ser thr asp thr leu phe asn leu ser arg 
1351/451 

aac age tac gtg gac aac gtc aag acc aaa 
asn ser tyr val asp asn val lys thr lys 
1411/471 

cat atg gcg gee ggg cac gtc acg ttc gtg 
his met ala ala gly his val thr phe val 
1471/491 

ggc gac gag ccc ccg cgt acc tec gac atg 
gly asp glu pro pro arg thr ser asp met 
1531/511 

gat ccg ctg ccc ctg gaa aac gat cac aac 
asp pro leu pro leu glu asn asp his asn 
1591/531 

ccg ccg acc acc acc aag aag gcg ccg cag 
pro pro thr thr thr lys lys ala pro gin 
1651/551 

cag gtt acg acg acc teg cca aaa gag gtc 
gin val thr thr thr ser pro lys glu val 
1711/571 

gec ggt ttg gee acc acc gec acc gat cag 
ala gly leu ala thr thr ala thr asp gin 
1771/591 

ccg gac gac tac ccg agt teg ctg ctg gee 
pro asp asp tyr pro ser ser leu leu ala 
1831/611 

cag get gec gec acc gtg gec gec gtg ttc 
gin ala ala ala thr val ala ala val phe 
1891/631 

ate ggc caa ctg gtc cag gtg gtg ctg ggc 
ile gly gin leu val gin val val leu gly 
1951/651 

ccg agt ggc tec acc gtc age gtg cag ata 
pro ser gly ser thr val ser val gin ile 
2011/671 

ctg ccc gag gac ctg acg gtc acc aac gee 
leu pro glu asp leu thr val thr asn ala 



SEQ ID Ko&.67/^ continued 1) 



FIGURE 32E (continued 1) 



h <> 102/185 

1/1 31/11 



CGT CAC CTC TGC CAT GGT CCA TCT ACG GTA TCT GCG ACA AGG GCA GCG TCG ATC CCT CGA 
arg his leu cys his gly pro ser thr val ser ala thr arg ala ala ser ile pro arg 



S> 61/21 91/31 

V CAT GCA GAG TCG GTG TTC GCT TCA CGC GAA CTA GGC GCG CCT AGC CTG GAC GAG TCC CCG 

his ala glu ser val phe ala ser arg glu leu gly ala pro ser leu asp glu ser pro 

121/41 151/51 

GGC CGA CAT TCG CCC GAG GCC TTG GCC TCC ATC ACC TAA TTG TGT GCA AAA CCG TAT CTA 

gly arg his ser pro glu ala leu ala ser ile thr) OCH(leu cys ala lys pro tyr leu 
181/61 211/71 /t S£0n>Hft575 
ATT GAT ACG ATT GCG CAC ATG GCT ATC TGG GAT C 

ile asp thr ile ala his met ala ile trp asp) 



SEQ ID flDb'* 573-575 
FIGURE 33A 

Cl/1 31/11 

GTC ACC TCT GCC ATG GTC CAT CTA CGG TAT CTG CGA CAA GGG CAG CGT CGA TCC CTC GAC 

^j*(yal thr ser ala met val his leu arg tyr leu arg gin gly gin arg arg ser leu asp 
61/21 - 91/31 

...ATG ,CAG,, AGT , r CGG, TGT TCG CTT CAC GCG AAC TAG GCG , CGC CTA GCC TGG ACG .AGT CCC CGG 

met gin ser arg cys ser leu his ala asn\AMB(ala arg leu ala trp thr ser pro arg 

i2i/4i 'lsi/Si^saa-sfcAitt 57* 

GCC GAC ATT CGC CCG AGG CCT TGG CCT CCA TCA CCT AAT TGT GTG CAA AAC CGT ATC TAA 
ala asp ile arg pro arg pro trp pro pro ser pro asn cys val gin asn arg ile^ OCH 
181/61 211/71 

.TTG ATA CGA TTG CGC ACA TGG CTA TCT GGG ATC 

(vLeu ile arg leu arg thr trp leu ser gly ile) 

^jssq xx> no: 57*} 

SEQ ID /y/OS: 57^57? 
FIGURE 33B 

1/1 31/11 
# CCG TCA CCT CTG CCA TGG TCC ATC TAC GGT ATC TGC GAC AAG GGC AGC GTC GAT CCC TCG 
^7pro ser pro leu pro trp ser ile tyr gly ile cys asp lys gly ser val asp pro ser 
^ 61/21 91/31 

ACA TGC AGA GTC GGT GTT CGC TTC ACG CGA ACT AGG CGC GCC TAG CCT GGA CGA GTC CCC 
thr cys arq val gly val arg phe thr arg thr arg arg ala)AMB (pro gly arg val pro 
121/41 151/51 ^ WeQX&w.cfc* 

GGG CCG ACA TTC GCC CGA GGC CTT GGC CTC CAT CAC CTA ATT GTG TGC AAA ACC GTA TCT 
gly pro thr phe ala arg gly leu gly leu his his leu ile val cys lys thr val ser 
181/61 211/71 
AAT TGA TAC GAT TGC GCA CAT GGC TAT CTG GGA TC 
asn^OPA^tyr asp cys ala his gly tyr leu gly) 

seq id ND5:s < 5b- s %& 



FIGURE 33C 



103/185 



sequence Rvl044 predicted by Cole et al. (Nature 393 : 537-544 ) and containing 
seq33A 

1/1 31/11 
^*> 0 ^\ttg tgt gca aaa ccg tat eta att gat acg att gcg cac atg get ate tgg gat ege etc 
y jteu cys ala lys pro tyr leu ile asp thr ile ala his met ala ile trp asp arg leu 
61/21 91/31 

gtc gag gtt gec gec gag caa cat ggc tac gtc acg act cgc gat gcg cga gac ate ggc 
^> val glu val ala ala glu gin his gly tyr val thr thr arg asp ala arg asp ile gly 

121/41 151/51 

gtc gac cct gtg cag etc cgc etc eta gcg ggg cgc gga cgt ctt gag cgt gtc ggc cga 
val asp pro val gin leu arg leu leu ala gly arg gly arg leu glu arg val gly arg 
181/61 211/71 

ggt gtg tac egg gtg ccc gtg ctg ccg cgt ggt gag cac gac gat etc gca gee gca gtg 
gly val tyr arg val pro val leu pro arg gly glu his asp asp leu ala ala ala val 
241/81 271/91 

teg tgg act ttg ggg cgt ggc gtt ate teg cat gag teg gec ttg gcg ctt cat gec etc 
ser trp thr leu gly arg gly val ile ser his glu ser ala leu ala leu his ala leu 
301/101 331/111 

get gac gtg aac ccg teg cgc ate cat etc acc gtc ccg cgc aac aac cat ccg cgt gcg 
ala asp val asn pro ser arg ile his leu thr val pro arg asn asn his pro arg ala 
361/121 391/131 

gee ggg ggc gag ctg tac cga gtt cac cgc cgc gac etc cag gca gee cac gtc act teg 
ala gly gly glu leu tyr arg val his arg arg asp leu gin ala ala his val thr ser 
, ... 421/141 ■ - ■ 451/151 ♦ .. . ... .. • . 

gtc gac gga ata ccc gtc acg acg gtt gcg cgc acc ate aaa gac tgc gtg aag acg ggc 
val asp gly ile pro val thr thr val ala arg thr ile lys asp cys val lys thr gly 
481/161 511/171 

acg gat cct tat cag ctt . egg gee gcg ate gag cga gee gaa gec gag ggc acg ctt cgt 
thr asp pro tyr gin leu arg ala ala ile glu arg ala glu ala glu gly thr leu arg 
541/181 571/191 

cgt ggg tea gca get gag eta cgc get gcg etc gat gag acc act gec gga tta cgc get 
arg gly ser ala ala glu leu arg ala ala leu asp glu thr thr ala gly leu arg ala 
601/201 

egg ccg aag cga gca teg gcg tga 
arg pro lys arg ala ser ala^OPA 



SEQ ID A/DS'. S<SM, 5%5 
FIGURE 33D 



104/185 



^ORF according to Cole et al . (Nature 393:537-544) and containing Rvl044 

<& 1/1 31/11 

v^taa ttg tgt gca aaa ccg tat eta att gat acg att gcg cac atg get ate tgg gat cgc 
OCH (leu cys ala lys pro tyr leu ile asp thr ile ala his met ala ile trp asp arg 
6\/21*-SGVrt%>tiU.*>%l 91/31 

etc gt- gag gtt gcc gcc gag caa cat ggc. tac gtc acg act cgc gat gcg cga gac ate 
leu val glu val ala ala glu gin his gly tyr val thr thr arg asp ala arg asp ile 
121/41 151/51 

ggc gtc gac cct gtg cag etc cgc etc eta gcg ggg cgc gga cgt ctt gag cgt gtc ggc 
• gly val asp pro val gin leu arg leu leu ala gly arg gly arg leu glu arg val gly 
181/61 211/71 

cga ggt gtg tac egg gtg ccc gtg ctg ccg cgt ggt gag cac gac gat etc gca gcc gca 
arg gly val tyr arg val pro val leu pro arg gly glu his asp asp leu ala ala ala 
241/81 271/91 

gtg teg tgg act ttg ggg cgt ggc gtt ate teg cat gag teg gcc ttg gcg ctt cat gcc 
val ser trp thr leu gly arg gly val ile ser his glu ser ala leu ala leu his ala 
301/101 331/111 

etc get gac gtg aac ccg teg cgc ate cat etc ace gtc ccg cgc aac aac cat ccg cgt 
leu ala asp val asn pro ser arg ile his leu thr val pro arg asn asn his pro arg 
361/121 391/131 

gcg gcc ggg ggc gag ctg tac cga gtt cac cgc cgc gac etc cag gca gcc cac gtc act 
ala ala gly gly glu leu tyr arg val his arg arg asp leu gin ala ala his val thr 
421/141 451/151 

teg gtc gac gga ata ccc gtc acg acg gtt gcg cgc ace ate aaa gac tgc gtg aag acg 
ser val asp gly ile pro val thr thr val ala arg thr ile lys asp cys val lys thr 
481/161' " ' ' " 511/171 

ggc acg gat cct tat cag ctt egg gcc gcg ate gag cga gcc gaa gcc gag ggc acg ctt 
gly thr asp pro tyr gin leu arg ala ala ile glu arg ala glu ala glu gly thr leu 
541/181 571/191 

cgt cgt ggg tea gca get gag eta cgc get gcg etc gat gag ace act gcc gga tta cgc 
arg arg gly ser ala ala glu leu arg ala ala leu asp glu thr thr ala gly leu arg 
601/201 

get egg ccg aag cga gca teg gcg tga 
ala arg pro lys arg ala ser alayOPA 

SEQ ID 

FIGURE 33B 

\ 1/1 31/11 

^-^ATC CAA CCT GCT GGG CCT GCG CCT TCG AAT CGA CGG CCA GGC CAC CGC TCG CTG CCG GCA 
.Vile gin pro ala gly pro ala pro ser asn arg arg pro gly his arg ser leu pro ala 
%1/21 91/31 
*' ACA ACA CCT GGA ATG GGG ACC TTT TCG GTG TTG CTG GTA ACC GGG ACA ACC GGC ACC ACG 

thr thr pro gly met gly thr phe ser val leu leu val thr gly thr thr gly thr thr 
121/41 151/51 

CCT CGG TCG AGA CGT ATC GCG GCA GCG TTG GCC CTG TCG TTG CTG ACA ATT ACC GCT GGC 
pro arg ser arg arg ile ala ala ala leu ala leu ser leu leu thr ile thr ala gly 
181/61 211/71 
CGC CGC ATA TTT GCC GCG CTG CCG CGG GCC GGA TC 
arg arg ile phe ala ala leu pro arg ala gly) 



SEQ ID uds;5%%, 5^ 



FIGURE 34A 



105/185 



<^1/1 31/11 



V^TCC AAC CTG CTG GGC CTG CGC CTT CGA ATC GAC GGC CAG GCC ACC GCT CGC TGC CGG CAA 

/ser asn leu leu gly leu arg leu arg ile asp gly gin ala thr ala arg cys arg gin 
«5.#^61/21 91/31 

^ CAA CAC CTG GAA TGG GGA CCT TTT CGG TGT TGC TGG TAA CCG GGA CAA CCG GCA CCA CGC 

gin his leu glu trp gly pro phe arg cys cys trp^OCH (pro gly gin pro ala pro arg 
^ 12.1/41 151/51 J C&Z&32> ^C>:S9^ 

. CTC GGT CGA GAC GTA TCG CGG CAG CGT TGG CCC TGT CGT TGC TGA CAA TTA CCG CTG GCC 

leu gly arg asp val ser arg gin arg trp pro cys arg cys^OPA (cjln leu pro leu ala 
181/61 211/71 ^seQxbNOl S?^ 

GCC GCA TAT TTG CCG CGC TGC CGC GGG CCG GAT C 

ala ala tyr leu pro arg cys arg gly pro asp) 



SEQ ID A(DS'. 5?£>- 5*7-3 
FIGURE 34B 



^ Q i/i 31/11 

\*GAT CCA ACC TGC TGG GCC TGC GCC TTC GAA TCG ACG GCC AGG CCA CCG CTC GCT GCC GGC 

S>(asp pro thr cys trp ala cys ala phe glu ser thr ala arg pro pro leu ala ala gly 

^61/21 91/31 

AAC AAC ACC TGG AAT GGG GAC CTT TTC GGT GTT GCT GGT AAC CGG GAC AAC CGG CAC CAC 

<b<jJ* asn asn thr trp asn gly asp leu phe gly val ala gly asn arg asp asn arg his his 

121/41 151/51 

GCC TCG GTC GAG ACG TAT CGC GGC AGC GTT GGC CCT GTC GTT GCT GAC AAT TAC CGC TGG 

ala ser val glu thr tyr arg gly ser val gly pro val val ala asp asn tyr arg trp 

181/61 211/71 

CCG CCG CAT ATT TGC CGC GCT GCC GCG GGC CGG ATC 

pro pro his ile cys arg ala ala ala gly arg ile^ 



SEQ ID M0S\S<f^ t 5f£ 
FIGURE 34C 



* 106/185 

y 

ORF according to Cole et al . (Nature 393:537-544) containing seq34A 



1/1 

^tag ccg cag ggc cct gcg 
AMB^pro gin gly pro ala 
6l/2i^$e<& MO*. 5<?7 
ttg cag cag tta caa cgc 
leu gin gin leu gin arg 
121/41 

agg gga gac gtt gcg cag 
arg gly asp val ala gin 
181/61 

cca ggc cac cgc teg ctg 
pro gly his arg ser leu 
241/81 

gta acc ggg aca acc ggc 
val thr gly thr thr gly 
301/101 

teg ttg ctg aca att acc 
ser leu leu thr ile thr 
361/121 

agg teg acc tgc cag ate 
arg ser thr cys gin ile 
421/141 

gcg acg gca ggc cca etc 
ala thr ala gly pro leu 
481/161 

etc acg ctt ggc ttc atg 
leu thr leu gly phe met 



31/11 

get agg cgc ggc egg tgc cgt 
ala arg arg gly arg cys arg 
91/31 

caa atg gag tct gag cgc ate 
gin met glu ser glu arg ile 
151/51 

cga egg ate caa cct get ggg 
arg arg ile gin pro ala gly 
211/71 

ccg gca aca aca cct gga atg 
pro ala thr thr pro gly met 
271/91 

acc acg cct egg teg aga cgt 
thr thr pro arg ser arg arg 
331/111 

get ggc cgc cgc ata ttt gee 
ala gly arg arg ile phe ala 
391/131 

tea ccg cgc age ate tac gee 
ser pro arg ser ile tyr ala 
451/151 

tct tgg cat gcg tec aat get 
ser trp his ala ser asn ala 
511/171 

ccg cag tec tac cca tgt agt 
pro gin ser tyr pro cys ser 



tgg ccg egg egg caa teg atg 
trp pro arg arg gin ser met 

gtc gag ttc gat cag etc ggc 
val glu phe asp gin leu gly 

cct gcg cct teg aat cga egg 
pro ala pro ser asn arg arg 

ggg acc ttt teg gtg ttg ctg 
gly thr phe ser val leu leu 

ate gcg gca gcg ttg gee ctg 
ile ala ala ala leu ala leu 

gcg ctg ccg egg gee gga tec 
ala leu pro arg ala gly ser 

gtt cgc tgc aaa ccg ccg act 
val arg cys lys pro pro thr 

gcg acg tec teg gta gac aag 
ala thr ser ser val asp lys 

aac aga tag 
asn arg^AMB 



SEQ ID NDS-.59&,S97 
FIGURE 34E 



31/11 

^CAG TCT GTC GGC AAG GAG GGA CGC ATG CCA CTC TCC GAT CAT GAG CAG CGG ATG CTT GAC 
-gin ser val gly lys glu gly arg met pro leu ser asp his glu gin arg met leu asp 
* 61/21 91/31 

CAG ATC GAG AGC GCT CTC TAC GCC GAA GAT CCC AAG TTC GCA TCG AGT GTC CGT GGC GGG 
gin ile glu ser ala leu tyr ala glu asp pro lys phe ala ser ser val arg gly gly 
121/41 151/51 

GGC TTC CGC GCA CCG ACC GCG CGG CGG CGC CTG CAG GGC GCG GCG TTG TTC ATC ATC GGT 
gly phe arg ala pro thr ala arg arg arg leu gin gly ala ala leu phe ile ile gly 
181/61 211/71 

CTG GGG ATG TTG GTT TCC GGC GTG GCG TTC AAA GAG ACC ATG ATC GGA AGT TTC CCG ATA 
leu gly met leu val ser gly val ala phe lys glu thr met ile gly ser phe pro ile 
241/81 271/91 

CTC AGC GTT TTC GGT TTT GTC GTG ATG TTC GGT GGT GTG GTG TAT GCC ATC ACC GGT CCT 
leu ser val phe gly phe val val met phe gly gly val val tyr ala ile thr gly pro 
301/101 331/111 

CGG TTG TCC GGC AGG ATG GAT CGT GGC GGA TCG GCT GCT GGG GCT TCG CGC CAG CGT CGT 
arg leu ser gly arg met asp arg gly gly ser ala ala gly ala ser arg gin arg arg 
361/121 391/131 
ACC AAG GGG GCC GGG GGC TCA TTC ACC AGC CGT ATG GAA GAT C 
thr lys gly ala gly gly ser phe thr ser arg met glu aspy 



SEQ ID . 59^.5^ 
FIGURE 35A 



107/185 



.(.CP 

VP" 



1/1 31/11 
v^GAC AGT CTG TCG GCA AGG AGG GAC GCA TGC CAC TCT CCG ATC ATG AGC AGC GGA TGC TTG 
/asp ser leu ser ala arg arg asp ala cys his ser pro ile met ser ser gly cys leu 
V ^S61/21 91/31 
aI^ ACC AGA TCG AGA GCG CTC TCT ACG CCG AAG ATC CCA AGT TCG CAT CGA GTG TCC GTG GCG 
thr arg ser arg ala leu ser thr pro lys ile pro ser ser his arg val ser val ala 
121/41 151/51 

GGG GCT TCC GCG CAC CGA CCG CGC GGC GGC GCC TGC AGG GCG CGG CGT TGT TCA TCA TCG 
. gly ala ser ala his arg pro arg gly gly ala cys arg ala arg arg cys ser ser ser 
181/61 211/71 

GTC TGG GGA TGT TGG TTT CCG GCG TGG CGT TCA AAG AGA CCA TGA TCG GAA GTT TCC CGA 
val trp qly cys trp phe pro ala trp arg ser lys arg pro)oPA(ser glu val ser arg 
241/81 271/91 ' ^tGQKbHO: 

TAC TCA GCG TTT TCG GTT TTG TCG TGA TGT TCG GTG GTG TGG TGT ATG CCA TCA CCG GTC 
tyr ser ala phe ser val leu ser) OPA(cys ser val val trp cys met pro ser pro val 
301/101 Se5?I>NP:6p03 331/111 

CTC GGT TGT CCG GCA GGA TGG ATC GTG GCG GAT CGG CTG CTG GGG CTT CGC GCC AGC GTC 

leu gly cys pro ala gly trp ile val ala asp arg leu leu gly leu arg ala ser val 

361/121 391/131 

GTA CCA AGG GGG CCG GGG GCT CAT TCA CCA GCC GTA TGG AAG ATC 

val pro arg gly pro gly ala his ser pro ala val trp lys ile) 



' SEQ ID HOh\ (sDO-faD'S, 
FIGURE 35B 



V 



1/1 


















31/11 


















'ACA GTC 


TGT 


CGG 


CAA 


GGA 


GGG 


ACG 


CAT 


GCC 


ACT CTC 


CGA 


TCA 


TGA 


GCA 


GCG 


GAT 


GCT 


TGA 


[thr val 
61/21 


cys 


arg 


gin 


gly 


gly 


thr 


his 


ala 


thr leu 
91/31 


arg 


ser) 


OPA< 


'ala ala asp ala) 


iOPA 


CCA GAT 


CGA 


GAG 


CGC 


TCT 


CTA 


CGC 


CGA 


AGA 


TCC CAA 


GTT 


CGC 


ATC 


GAG 


TGT 


CCG 


TGG 


CGG 


pro asp 


arg 


glu 


arg 


ser 


leu 


arg 


arg 


arg 


ser gin 


val 


arg 


ile 


glu 


cys 


pro 


trp 


arg 


'121/41 


















151/51 


















GGG CTT 


CCG 


CGC 


ACC 


GAC 


CGC 


GCG 


GCG 


GCG 


CCT GCA 


GGG 


CGC 


GGC 


GTT 


GTT 


CAT 


CAT 


CGG 


gly leu 


pro 


arg 


thr 


asp 


arg 


ala 


ala 


ala 


pro ala 


gly 


arg 


gly 


val 


val 


his 


his 


arg 


181/61 














211/71 


















TCT GGG 


GAT 


GTT 


GGT 


TTC 


CGG 


CGT 


GGC 


GTT 


CAA AGA 


GAC 


CAT 


GAT 


CGG 


AAG 


TTT 


CCC 


GAT 


ser gly 


asp 


val 


gly 


phe 


arg 


arg 


gly 


val 


gin arg 


asp 


his 


asp 


arg 


lys 


phe 


pro 


asp 


241/81 


















271/91 


















ACT CAG 


CGT 


TTT 


CGG 


TTT 


TGT 


CGT 


GAT 


GTT 


CGG TGG 


TGT 


GGT 


GTA 


TGC 


CAT 


CAC 


CGG 


TCC 


thr gin 


arg 


phe 


arg 


phe 


cys 


arg 


asp 


val 


arg trp 


cys 


gly 


val 


cys 


his 


his 


arg 


ser 


301/101 














331/111 
















TCG 


TCG GTT 


GTC 


CGG 


CAG 


GAT 


GGA 


TCG 


TGG 


CGG 


ATC GGC 


TGC 


TGG 


GGC 


TTC 


GCG 


CCA 


GCG 


ser val 


val 


arg 


gin 


asp 


gly 


ser 


trp 


arg 


ile gly 


cys 


trp 


gly 


phe 


ala 


pro 


ala 


ser 


361/121 














391/131 


















TAC CAA 


GGG 


GGC 


CGG 


GGG 


CTC 


ATT 


CAC 


CAG 


CCG TAT 


GGA 


AGA 


TC 












tyr gin 


gly 


gly 


arg 


gly 


leu 


ile 


his 


gin 


pro tyr 


gly 


arg 


) 













seq id 6 0*4 -0>oi 

FIGURE 35C 



bo 
O 

^ 108/185 
o 

^ sequence Rv2169c predicted by Cole et al. (Nature 393:537-544) and partially 
q, containing seq35A 

7 1/1 31/11 

Vyatg cca etc tec gat cat gag cag egg atg ctt gac cag ate gag age get etc tac gee 
*^Met pro leu ser asp his glu gin arg met leu asp gin ile glu ser ala leu tyr ala 
\& 61/21 91/31 

gaa gat ccc aag ttc gca teg agt gtc cgt ggc ggg ggc ttc cgc gca ccg ace gcg egg 
glu asp pro lys phe ala ser ser val arg gly gly gly phe arg ala pro thr ala arg 
121/41 151/51 

egg cgc ctg cag ggc gcg gcg ttg ttc ate ate ggt ctg ggg atg ttg gtt tec ggc gtg 
arg arg leu gin gly ala ala leu phe ile ile gly leu gly met leu val ser gly val 
181/61 211/71 

gcg ttc aaa gag ace atg ate gga agt ttc ccg ata etc age gtt ttc ggt ttt gtc gtg 
ala phe lys glu thr met ile gly ser phe pro ile leu ser val phe gly phe val val 
241/81 271/91 

atg ttc ggt ggt gtg gtg tat gee ate acc ggt cct egg ttg tec ggc agg atg gat cgt 
met phe gly gly val val tyr ala ile thr gly pro arg leu ser gly arg met asp arg 
301/101 331/111 

ggc gga teg get get ggg get teg cgc cag cgt cgt acc aag ggg gee ggg ggc tea ttc 

gly gly ser ala ala gly ala ser arg gin arg arg thr lys gly ala gly gly ser phe 

361/121 391/131 

acc age cgt atg gaa gat egg ttc egg cgc cgc ttc gac gag taa 

thr ser arg met glu asp arg phe arg arg arg phe asp glu) OCH 



SEQ ID tf0 5: fco2-&Oi 
FIGURE 35D 



tyORF according to Cole et al . (Nature 393:537-544) and containing Rv2169c 

^ 1/1 31/11 

^|tga cag tct gtc ggc aag gag gga cgc atg cca etc tec gat cat gag cag egg atg ctt 
OPA(gln ser val gly lys glu gly arg met pro leu ser asp his glu gin arg met leu 
61/21^5^x5 NO\ (p\\ 91/31 

gac cag ate gag age get etc tac gee gaa gat ccc aag ttc gca teg agt gtc cgt ggc 
asp gin ile glu ser ala leu tyr ala glu asp pro lys phe ala ser ser val arg gly 
121/41 151/51 

ggg ggc ttc cgc gca ccg acc gcg egg egg cgc ctg cag ggc gcg gcg ttg ttc ate ate 
gly gly phe arg ala pro thr ala arg arg arg leu gin gly ala ala leu phe ile ile 
181/61 211/71 

ggt ctg ggg atg ttg gtt tec ggc gtg gcg ttc aaa gag acc atg ate gga agt ttc ccg 
gly leu gly met leu val ser gly val ala phe lys glu thr met ile gly ser phe pro 
241/81 271/91 

ata etc age gtt ttc ggt ttt gtc gtg atg ttc ggt ggt gtg gtg tat gee ate acc ggt 
ile leu ser val phe gly phe val val met phe gly gly val val tyr ala ile thr gly 
301/JL01 

cct egg ttg tec ggc agg atg gat cgt ggc gga teg get get ggg get teg cgc cag cgt 
pro arg leu ser gly arg met asp arg gly gly ser ala ala gly ala ser arg gin arg 
361/121 391/131 

cgt acc aag ggg gec ggg ggc tea ttc acc age cgt atg gaa gat egg ttc egg cgc cgc 
arg thr lys gly ala gly gly ser phe thr ser arg met glu asp arg phe arg arg arg 
421/141 



ttc gac gag taa 
phe asp glu^OCH 



SEQ ID fsl03\(p(d t (s>([ 



FIGURE 35E 



109/185 

$ 1/1 31/11 



Sp»GAC CTG 


GGA 


CGA 


AGA 


CGA 


CGG 


CAG 


CAG 


CCG 


CAA TCA 


GAT 


CTA 


CCC 


GGT 


CCT 


GGT 


CAA 


CGT 


iasp leu 
Al/21 


gly 


arg 


arg 


arg 


arg 


gin 


gin 


pro 


gin ser 


asp 


leu 


pro 


gly 


pro 


gly 


gin 


arg 


















91/31 


















CAA TGG 


ACA 


CCC 


GAC 


TAC 


GGT 


GCG 


CCT 


GCG 


CGG CTC 


GAC 


AAT 


GCG 


CGG 


TTC 


CTG 


TTG 


CCC 


gin trp 


thr 


pro 


asp 


tyr 


gly 


ala 


pro 


ala 


arg leu 


asp 


asn 


ala 


arg 


phe 


leu 


leu 


pro 


121/41 


















151/51 


















GTG GTC 


GGA 


GTG 


CCA 


CCC 


GAC 


CAG 


GCC 


ACC 


GAC TTC 


GGC 


TCC 


GCT 


GTT 


GCA 


CCA 


GAA 


ACG 


■ val val 


gly 


val 


pro 


pro 


asp 


gin 


ala 


thr 


asp phe 


gly 


ser 


ala 


val 


ala 


pro 


glu 


thr 


181/61 


















211/71 
















ACG GCG 


CCG 


GTC 


TGG 


ATC 


ACC 


ATG 


CTG 


TGG 


CCG CTG 


GCC 


GAC 


CGG 


CCC 


CGG 


TTG 


GCC 


CCC 


thr ala 


pro 


val 


trp 


ile 


thr 


met 


leu 


trp 


pro leu 


ala 


asp 


arg 


pro 


arg 


leu 


ala . 


pro 


241/81 


















271/91 


















GGG GCA 


CCC 


GGT 


GGC 


ACC 


GTT 


CCC 


GTC 


CGG 


CTG GTC 


GAC 


GAC 


GAC 


CTG 


GCA 


AAC 


TCG 


CTG 


gly ala 


pro 


gly 


gly 


thr 


val 


pro 


val 


arg 


leu val 


asp 


asp 


asp 


leu 


ala 


asn 


ser 


leu 


301/101 


















331/111 


















GCC AAC 


GGC 


GGC 


CGG 


CTG 


GAC 


ATC 


CTC 


CTG 


TCG GCG 


GCC 


GAG 


TTC 


GCC 


ACC 


AAC 


CGG 


GAA 


ala asn 


gly 


gly 


arg 


leu 


asp 


ile 


leu 


leu 


ser ala 


ala 


glu 


phe 


ala 


thr 


asn 


arg 


glu 


361/121 


















391/131 


















GTC GAC 


CCC 


GAC 


GGC 


GCC 


GTC 


GGC 


CGA 


GCG 


CTG TGC 


CTG 


GCC 


ATC 


GAC 


CCA 


GAT 


C 




val asp 


pro 


asp 


gly 


ala 


val 


gly 


arg 


ala 


leu cys 


leu 


ala 


ile 


asp 


pro 


asp) 





SEQ ID AIDS'. (s>(3L-(h(3^ 
FIGURE 36A 

( 1/1 31/11 

7 ACC TGG GAC GAA GAC GAC GGC AGC AGC CGC AAT CAG ATC TAC CCG GTC CTG GTC AAC GTC 
UvS^thr trp asp glu asp asp gly ser ser arg asn gin ile tyr pro val leu val asn val 
61/21 91/31 

AAT GGA CAC CCG ACT ACG GTG CGC CTG CGC GGC TCG ACA ATG CGC GGT TCC TGT TGC CCG 
asn gly his pro thr thr val arg leu arg gly ser thr met arg gly ser cys cys pro 
121/41 151/51 

TGG TCG GAG TGC CAC CCG ACC AGG CCA CCG ACT TCG GCT CCG CTG TTG CAC CAG AAA CGA 
trp ser glu cys his pro thr arg pro pro thr ser ala pro leu leu his gin lys arg 
181/61 211/71 

CGG CGC CGG TCT GGA TCA CCA TGC TGT GGC CGC TGG CCG ACC GGC CCC GGT TGG CCC CCG 
arg arg arg ser gly ser pro cys cys gly arg trp pro thr gly pro gly trp pro pro 
241/81 271/91 

GGG CAC CCG GTG GCA CCG TTC CCG TCC GGC TGG TCG ACG ACG ACC TGG CAA ACT CGC TGG 
gly his pro val ala pro phe pro ser gly trp ser thr thr thr trp gin thr arg trp 
301/101 _ _ 331/111_ _ 

CCA ACG GCG GCC ' GGC TGG ACA TCC TCC TGT CGG'CGG CCG AGT TCG CCA CCA ACC GGG AAG 
pro thr ala ala gly trp thr ser ser cys arg arg pro ser ser pro pro thr gly lys 
361/121 391/131 

TCG ACC CCG ACG GCG CCG TCG GCC GAG CGC TGT GCC TGG CCA TCG ACC CAG ATC 
ser thr pro thr ala pro ser ala glu arg cys ala trp pro ser thr gin ile) 



SEQ ID ^05'. 
FIGURE 36B 



110/185 



31/11 

L/CCT GGG ACG AAG ACG ACG GCA GCA GCC GCA ATC AGA TCT ACC CGG TCC TGG TCA ACG TCA 
.Wpro gly thr lys thr thr ala ala ala ala ile arg ser thr arg ser trp ser thr ser 
<*"*61/21 91/31 

ATG GAG ACC CGA CTA CGG TGC GCC TGC GCG GCT CGA CAA TGC GCG GTT CCT GTT GCC CGT 
met asp thr arg leu arg cys ala cys ala ala arg gin cys ala val pro val ala arg 
121/41 151/51 

GGT CGG AGT GCC ACC CGA CCA GGC CAC CGA CTT CGG CTC CGC TGT TGC ACC AGA AAC GAC 
gly arg ser ala thr arg pro gly his arg leu arg leu arg cys cys thr arg asn asp 
181/61 211/71 

GGC GCC GGT CTG GAT CAC CAT GCT GTG GCC GCT GGC CGA CCG GCC CCG GTT GGC CCC CGG 
gly ala gly leu asp his his ala val ala ala gly arg pro ala pro val gly pro arg 
241/81 271/91 

GGC ACC CGG TGG CAC CGT TCC CGT CCG GCT GGT CGA CGA CGA CCT GGC AAA CTC GCT GGC 
gly thr arg trp his arg ser arg pro ala gly arg arg arg pro gly lys leu ala gly 
301/101 331/111 

CAA CGG CGG CCG GCT GGA CAT CCT CCT GTC GGC GGC CGA GTT CGC CAC CAA CCG GGA AGT 
gin arg arg pro ala gly his pro pro val gly gly arg val arg his gin pro gly ser 
361/121 391/131 

CGA CCC CGA CGG CGC CGT CGG CCG AGC GCT GTG CCT GGC CAT CGA CCC AGA TC 
arg pro arg arg arg arg arg pro ser ala val pro gly his arg pro argy 

SEQ ID 7 
FIGURE 36C 

^Coding sequence Rv3909 predicted by Cole et al., 1998 (Nature 393 537-544) 
^containing Seq 36A 





















31/11 


















yGTG ACC 


GCA 


CTG 


CAA 


CTC 


GGC 


TGG 


GCC 


GCT 


TTG GCG 


CGC 


GTC 


ACC 


TCA 


GCG 


ATC 


GGC 


GTC 


(met thr 


ala 


leu 


gin 


leu 


gly 


trp 


ala 


ala 


leu ala 


arg 


val 


thr 


ser 


ala 


ile 


gly 


val 


61/21 


















91/31 
















GGC 


GTG GCC 


GGC 


CTC 


GGG 


ATG 


GCG 


CTC 


ACG 


GTA 


CCG TCG 


GCG 


GCA 


CCG 


CAC 


GCG 


CTC 


GCA 


val ala 


gly 


leu 


gly 


met 


ala 


leu 


thr 


val 


pro ser 


ala 


ala 


pro 


his 


ala 


leu 


ala 


gly 


121/41 














151/51 


















GAG CCC 


AGC 


CCG 


ACG 


CCT 


TTT 


GTC 


CAG 


GTC 


CGC ATC 


GAT 


CAG 


GTG 


ACC 


CCG 


GAC 


GTG 


GTG 


glu pro 


ser 


pro 


thr 


pro 


phe 


val 


gin 


val 


arg ile 


asp 


gin 


val 


thr 


pro 


asp 


val 


val 


181/61 


















211/71 
















CGC 


ACC ACT 


TCC 


AGC 


GAA 


CCC 


CAT 


GTC 


ACC 


GTC 


AGC GGA 


ACG 


GTG 


ACC 


AAT 


ACC 


GGT 


GAC 


thr thr 


ser 


ser 


glu 


pro 


his 


val 


thr 


val 


ser gly 


thr 


val 


thr 


asn 


thr 


gly 


asp 


arg 


2.41/81 
















271/91 
















GCG 


CCA GTC 


CGC 


GAT 


GTG 


ATG 


GTC 


CGG 


CTT 


GAG 


CAC GCC 


GCC 


GCG 


GTC 


ACG 


TCG 


TCA 


ACG 


pro val 


arg 


asp 


val 


met 


val 


arg 


leu 


glu 


his ala 


ala 


ala 


val 


thr 


ser 


ser 


thr 


ala 


301/101 
















331/111 














CTC 


ACG 


TTA CGC 


ACC 


TCG 


CTC 


GAC 


GGC 


GGC 


ACC 


GAC 


CAG TAC 


CAG 


CCG 


GCC 


GCG 


GAC 


TTC 


leu arg 


thr 


ser 


leu 


asp 


gly 


gly 


thr 


asp 


gin tyr 


gin 


pro 


ala 


ala 


asp 


phe 


leu 


thr 



seq id mc5: fafS-fcil 



FIGURE 36D 



111/185 



361/121 

GTC GCC CCC GAA CTA GAC CGC GGG CAA GAG 
val ala pro glu leu asp arg gly gin glu 
421/141 

TCG CTG ACC AGG CCG TCG TTG GCC GTC AAC 
ser leu thr arg pro ser leu ala val asn 
481/161 

GTC AAT GGG ACA CCC GAC TAC GGT GCG CCT 
val asn gly thr pro asp tyr gly ala pro 
541/181 

CCC GTG GTC GGA GTG CCA CCC GAC CAG GCC 
pro val val gly val pro pro asp gin ala 
601/201 

ACG ACG GCG CCG GTC TGG ATC ACC ATG CTG 
thr thr ala pro val trp ile thr met leu 
661/221 

CCC GGG GCA CCC GGT GGC ACC GTT CCC GTC 
pro gly ala pro gly gly thr val pro val 
721/241 

CTG GCC AAC GGC GGC CGG CTG GAC ATC CTC 
leu ala asn gly gly arg leu asp ile leu 
781/261 

GAA GTC GAC CCC GAC GGC GCC GTC GGC CGA 
glu val asp pro asp gly ala val gly arg 
,841/281 t h 

CTC ATC ACC GTC AAT GCG ATG ACC GGC GGC 
leu ile thr val asn ala met thr gly gly 
901/301 

GCT CAA CTA CCG GGC ACC CCG ACC CAC CCG 
ala gin leu pro gly thr pro thr his pro 
961/321 

CTG GAT CGA TTG CGG ACG CTA GTC CAC CGG 
leu asp arg leu arg thr leu val his arg 
1021/341 

GCC GAC CTG GAT GCT TTG CAG CGG GTT AAT 
ala asp leu asp ala leu gin arg val asn 
1081/361 

AGC CCC GCC GAC ATC GTC GAC CGC ATC CTG 
ser pro ala asp ile val asp arg ile leu 
1141/381 

CTG CCC GAC GGC CCG TTG ACC GGC CGG GCG 
leu pro asp gly pro leu thr gly arg ala 
1201/401 

GTT GCC GTC GCG GCC GCC GAT TTT AGC CCC 
val ala val ala ala ala asp phe ser pro 
1261/421 

TCC GCG CTC TTA CCC GCT ACC GCG CCC CGG 
ser ala leu leu pro ala thr ala pro arg 
1321/441 - - 

TTT GAT CCC GCG GTC GGG GCC GCG CTG GCC 
phe asp pro ala val gly ala ala leu ala 
1381/461 

TAT CTA GAT CCC TCG TTG TTC GTT CGG ATC 
tyr leu asp pro ser leu phe val arg ile 



seq id a/£S; 

FIGURE 36D 



391/131 

GCC GGC TTT ACC CTC TCG GCC CCG CTG CGC 
ala gly phe thr leu ser ala pro leu arg 
451/151 

CAG CCC GGG ATC TAC CCG GTC CTG GTC AAC 
gin pro gly ile tyr pro val leu val asn 
511/171 

GCG CGG CTC GAC AAT GCG CGG TTC CTG TTG 
ala arg leu asp asn ala arg phe leu leu 
571/191 

ACC GAC TTC GGC TCC GCT GTT GCA CCA GAA 
thr asp phe gly ser ala val ala pro glu 
631/211 

TGG CCG CTG GCC GAC CGG CCC CGG TTG GCC 
trp pro leu ala asp arg pro arg leu ala 
691/231 

CGG CTG GTC GAC GAC GAC CTG GCA AAC TCG 
arg leu val asp asp asp leu ala asn ser 
751/251 

CTG TCG GCG GCC GAG TTC GCC ACC AAC CGG 
leu ser ala ala glu phe ala thr asn arg 
811/271 

GCG CTG TGC CTG GCC ATC GAC CCA GAT CTA 
ala leu cys leu ala ile asp pro asp leu 
871/291 

TAC GTC GTG TCC GAC TCG CCC GAC ' GGG GCC 
tyr val val ser asp ser pro asp gly ala 
931/311 

GGC ACC GGC CAG GCC GCC GCA TCC AGC TGG 
gly thr gly gin ala ala ala ser ser trp 
991/331 

ACA TGC GTG ACG CCG CTG CCT TTT GCC CAA 
thr cys val thr pro leu pro phe ala gin 
1051/351 

GAT CCG AGG CTG AGC GCG ATC GCA ACC ATC 
asp pro arg leu ser ala ile ala thr ile 
1111/371 

GAT GTC AGC TCC ACC CGC GGC GCA ACC GTG 
asp val ser ser thr arg gly ala thr val 
1171/391 

ATC AAC TTG CTC AGC ACC CAC GGC AAC ACG 
ile asn leu leu ser thr his gly asn thr 
1231/411 

GAG GAA CAG CAG GGT TCG TCC CAG ATC GGC 
glu glu gin gin gly ser ser gin ile gly 
1291/431 

CGG TTG TCC CCG CGG GTG GTA GCG GCG CCG 
arg leu ser pro arg val val ala ala pro 

1351/451 

GCC GCG GGA ACA AAC CCG ACC GTT CCT ACC 
ala ala gly thr asn pro thr val pro thr 
1411/471 

GCG CAT GAA TCG ATC ACC GCG CGC CGC CAG 
ala his glu ser ile thr ala arg arg gin 



(continued 1) 



112/185 



1441/481 

GAC GCC TTG GGC GCA ATG CTG TGG CGC AGC 
asp ala leu gly ala met leu trp arg ser 
1501/501 

ATC CTG GTG CCG CCG GCG TCG TGG AGC CTG 
ile leu val pro pro ala ser trp ser leu 
1561/521 

GCG CTG GCC ACC GCC ATC CGG TCT GGC CTG 
ala leu ala thr ala ile arg ser gly leu 
1621/541 

GCT GAC GCC GCG GCC CGC ACC GAG CCA CCG 
ala asp ala ala ala arg thr glu pro pro 
1681/561 

GGC CGG TTC AAT GAC GAC ATC ACC ACG CAG 
gly arg phe asn asp asp ile thr thr gin 
1741/581 

CTG ACC TCG GCG TTG ACC ATC GAT GAC CGC 
leu thr ser ala leu thr ile asp asp arg 
1801/601 

CCA CTA CGC GAG GAC ATG TTG CGC GCG CTG 
pro leu arg glu asp met leu arg ala leu 
1861/621 

GGG CTG GCC CAG CAG CGG CTG GCC GTC GTT 
gly leu ala gin gin arg leu ala val val 
1921/641 

GTG ACC ATC GTC AAC CCG GGC GGC TCC TAC 
val thr ile val asn pro gly gly ser tyr 
1981/661 

TTG GCG CTG CAT AAT GGC CTC GCC GTG CCA 
leu ala leu his asn gly leu ala val pro 
2041/681 

CCC GGG ATG ACG GTG GCC GAT GTC GGT CAG 
pro gly met thr val ala asp val gly gin 
2101/701 

CGA GTA CCA ATC GAG GTG AAC TTC ACA CAG 
arg val pro ile glu val asn phe thr gin 
2161/721 

CCC GAC GGC GTC GCG CTG GGT GAA CCG GTG 
pro asp gly val ala leu gly glu pro val 
2221/741 

AAG GTG TTG TTC GCG ATC ACG CTA TCC GCT 
lys val leu phe ala ile thr leu ser ala 
2281/761 

CGC CTT TGG CAC CGG TTC CGT GGC CAG CCT 
arg leu trp his arg phe arg gly gin pro 
2341/781 

CCT ACC GGC AAA CAC GCC CCG CAG CGC CGT 
"pro™ thr gly" lys "hxs^ala^ pro— gin- arg arg 
2401/801 
CGG GTA TGA 
arg val\ OPA 



1471/491 

TTG GAG CCG AAT GCC GCG CCC CGT ACC CAA 
leu glu pro asn ala ala pro arg thr gin 
1531/511 

GCC AGC GAC GAC GCG CAG GTC ATC CTG ACC 
ala ser asp asp ala gin val ile leu thr 
1591/531 

GCC GTG CCG CGA CCA CTA CCG GCG GTG ATC 
ala val pro arg pro leu pro ala val ile 
1651/551 

GAA CCC CCG GGC GCT TAC AGC GCC GCT CGC 
glu pro pro gly ala tyr ser ala ala arg 
1711/571 

ATC GGC GGG CAG GTT GCC CGG CTA TGG AAG 
ile gly gly gin val ala arg leu trp lys 
1771/591 

ACC GGG CTG ACC GGC GTG CAG TAC ACC GCA 
thr gly leu thr gly val gin tyr thr ala 
1831/611 

AGC CAA TCG CTA CCA CCC GAT ACC CGC AAC 
ser gin ser leu pro pro asp thr arg asn 
1891/631 

GGA AAG ACG ATC GAC GAT CTT TTC GGC GCG 
gly lys thr ile asp asp leu phe gly ala 
1951/651 " ' 

ACT CTG GCC ACC GAG CAC AGT CCG CTG CCG 
thr leu ala thr glu his ser pro leu pro 
2011/671 

ATC CGG GTC CGG CTA CAG GTC GAT GCT CCG 
ile arg val arg leu gin val asp ala pro 
2071/691 

ATC GAG CTA CCG CCC GGG TAC CTG CCG CTA 
ile glu leu pro pro gly tyr leu pro leu 
2131/711 

CGG GTT GCC GTC GAC GTG TCG CTG CGG ACC 
arg val ala val asp val ser leu arg thr 
2191/731 

CGG TTG TCG GTG CAC TCC AAC GCC TAC GGC 
arg leu ser val his ser asn ala tyr gly 
2251/751 

GCG GCC GTG CTG GTA ACG CTG GCG GGC CGG 
ala ala val leu val thr leu ala gly arg 
2311/771 

GAT CGC GCC GAC CTG GAT CGC CCC GAC CTG 
asp arg ala asp leu asp arg pro asp leu 
2371/791 

GCC GTA GCC AGT CGG GAT GAC GAA AAG CAC 
ala val- ala- ser— arg asp-asp glu lys- his 
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ORF according to Cole et al., 1998 (Nature 393 537-544) and containing Rv 3909. 
$ 1/1 31/11 

\aTGA ctc agc acc ggg tca gca caa cgg tcc cgg gcc ggg gcc gtg acc gca ctg caa ctc 

OPA(leu ser thr gly ser ala gin arg ser arg ala gly ala val thr ala leu gin leu 
61/21 *^QirOND:<S>2.| 91/31 

ggc tgg gcc gct ttg gcg cgc gtc acc tca gcg atc ggc gtc gtg gcc ggc ctc ggg atg 

gly trp ala ala leu ala arg val thr ser ala ile gly val val ala gly leu gly met 
. 121/41 151/51 

GCG CTC ACG GTA CCG TCG GCG GCA CCG CAC GCG CTC GCA GGC GAG CCC AGC CCG ACG CCT 
ala leu thr val pro ser ala ala pro his ala leu ala gly glu pro ser pro thr pro 
181/61 211/71 

TTT GTC CAG GTC CGC ATC GAT CAG GTG ACC CCG GAC GTG GTG ACC ACT TCC AGC GAA CCC 
phe val gin val arg ile asp gin val thr pro asp val val thr thr ser ser glu pro 
241/81 271/91 

CAT GTC ACC GTC AGC GGA ACG GTG ACC AAT ACC GGT GAC CGC CCA GTC CGC GAT GTG ATG 
his val thr val ser gly thr val thr asn thr gly asp arg pro val arg asp val met 
301/101 331/111 

GTC CGG CTT GAG CAC GCC GCC GCG GTC ACG TCG TCA ACG GCG TTA CGC ACC TCG CTC GAC 
val arg leu glu his ala ala ala val thr ser ser thr ala leu arg thr ser leu asp 
361/121 391/131 

GGC GGC ACC GAC CAG TAC CAG CCG GCC GCG GAC TTC CTC ACG GTC GCC CCC GAA CTA GAC 
gly gly thr asp gin tyr gin pro ala ala asp phe leu thr val ala pro glu leu asp 
421/141 451/151 
' CGC GGG'CAX GAG GCC GGC TTT ACC CTC TCG GCC CCG CTG CGC TCG CTG ACC AGG CCG TCG 
arg gly gin glu ala gly phe thr leu ser ala pro leu arg ser leu thr arg pro ser 
481/161 511/171 

TTG GCC GTC AAC CAG CCC GGG ATC TAC CCG GTC CTG GTC AAC GTC AAT GGG ACA CCC GAC 
leu ala val asn gin pro gly ile tyr pro val leu val asn val asn gly thr pro asp 
541/181 571/191 

TAC GGT GCG CCT GCG CGG CTC GAC AAT GCG CGG TTC CTG TTG CCC GTG GTC GGA GTG CCA 
tyr gly ala pro ala arg leu asp asn ala arg phe leu leu pro val val gly val pro 
601/201 631/211 

CCC GAC CAG GCC ACC GAC TTC GGC TCC GCT GTT GCA CCA GAA ACG ACG GCG CCG GTC TGG 
pro asp gin ala thr asp phe gly ser ala val ala pro glu thr thr ala pro val trp 
661/221 691/231 

ATC ACC ATG CTG TGG CCG CTG GCC GAC CGG CCC CGG TTG GCC CCC GGG GCA CCC GGT GGC 
ile thr met leu trp pro leu ala asp arg pro arg leu ala pro gly ala pro gly gly 
721/241 751/251 

ACC GTT CCC GTC CGG CTG GTC GAC GAC GAC CTG GCA AAC TCG CTG GCC AAC GGC GGC CGG 
thr val pro val arg leu val asp asp asp leu ala asn ser leu ala asn gly gly arg 
781/261 811/271 

CTG GAC ATC CTC CTG TCG GCG GCC GAG TTC GCC ACC AAC CGG GAA GTC GAC CCC GAC GGC 
leu asp ile leu leu ser ala ala glu phe ala thr asn arg glu val asp pro asp gly 
841/281 871/291 

GCC GTC GGC CGA GCG CTG TGC CTG GCC ATC GAC CCA GAT CTA CTC ATC ACC GTC AAT GCG 
ala val gly arg ala leu cys leu ala ile asp pro asp leu leu ile thr val asn ala 

90-1-A301 - - - — 931/311 - - 

ATG ACC GGC GGC TAC GTC GTG TCC GAC TCG CCC GAC GGG GCC GCT CAA CTA CCG GGC ACC 
met thr gly gly tyr val val ser asp ser pro asp gly ala ala gin leu pro gly thr 
961/321 991/331 

CCG ACC CAC CCG GGC ACC GGC CAG GCC GCC GCA TCC AGC TGG CTG GAT CGA TTG CGG ACG 
pro thr his pro gly thr gly gin ala ala ala ser ser trp leu asp arg leu arg thr 
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1021/341 

CTA GTC CAC CGG ACA TGC GTG ACG CCG CTG 
leu val his arg thr cys val thr pro leu 
1081/361 

CAG CGG GTT AAT GAT CCG AGG CTG AGC GCG 
gin arg val asn asp pro arg leu ser ala 
1141/381 

GAC CGC ATC CTG GAT GTC AGC TCC ACC CGC 
asp arg ile leu asp val ser ser thr arg 
1201/401 

ACC GGC CGG GCG ATC AAC TTG CTC AGC ACC 
thr gly arg ala ile asn leu leu ser thr 
1261/421 

GAT TTT AGC CCC GAG GAA CAG CAG GGT TCG 
asp phe ser pro glu glu gin gin gly ser 
1321/441 

ACC GCG CCC CGG CGG TTG TCC CCG CGG GTG 
thr ala pro arg arg leu ser pro arg val 
1381/461 

GCC GCG CTG GCC GCC GCG GGA ACA AAC CCG 
ala ala leu ala ala ala gly thr asn pro 
1441/481 

TTC GTT CGG ATC GCG CAT GAA TCG ATC ACC 
phe val arg ile ala his glu ser ile thr 
' 1501/501' " 
CTG TGG CGC AGC TTG GAG CCG AAT GCC GCG 
leu trp arg ser^ leu glu pro asn ala ala 
1561/521 

TCG TGG AGC CTG GCC AGC GAC GAC GCG CAG 
ser trp ser leu ala ser asp asp ala gin 
1621/541 

CGG TCT GGC CTG GCC GTG CCG CGA CCA CTA 
arg ser gly leu ala val pro arg pro leu 
1681/561 

ACC GAG CCA CCG GAA CCC CCG GGC GCT TAC 
thr glu pro pro glu pro pro gly ala tyr 
1741/581 

ATC ACC ACG CAG ATC GGC GGG CAG GTT GCC 
ile thr thr gin ile gly gly gin val ala 
1801/601 

ATC GAT GAC CGC ACC GGG CTG ACC GGC GTG 
ile asp asp arg thr gly leu thr gly val 
1861/621 

TTG CGC GCG CTG AGC CAA TCG CTA CCA CCC 
leu arg ala leu ser gin ser leu pro pro 
1921/641 

CTG GCC GTC GTT GGA AAG ACG ATC GAC GAT 
--leu—a-l-a val val gl-y-~l-ys -thr ile asp asp 
1981/661 

GGC GGC TCC TAC ACT CTG GCC ACC GAG CAC 
gly gly ser tyr thr leu ala thr glu his 



1051/351 

CCT TTT GCC CAA GCC GAC CTG GAT GCT TTG 
pro phe ala gin ala asp leu asp ala leu 
1111/371 

ATC GCA ACC ATC AGC CCC GCC GAC ATC GTC 
ile ala thr ile ser pro ala asp ile val 
1171/391 

GGC GCA ACC GTG CTG CCC GAC GGC CCG TTG 
gly ala thr val leu pro asp gly pro leu 
1231/411 

CAC GGC AAC ACG GTT GCC GTC GCG GCC GCC 
his gly asn thr val ala val ala ala ala 
1291/431 

TCC CAG ATC GGC TCC GCG CTC TTA CCC GCT 
ser gin ile gly ser ala leu leu pro ala 
1351/451 

GTA GCG GCG CCG TTT GAT CCC GCG GTC GGG 
val ala ala pro phe asp pro ala val gly 
1411/471 

ACC GTT CCT ACC TAT CTA GAT CCC TCG TTG 
thr val pro thr tyr leu asp pro ser leu 
1471/491 

GCG CGC CGC CAG GAC GCC TTG GGC GCA ATG 
ala arg arg gin asp ala leu gly ala met 
1531/511 ' 

CCC CGT ACC CAA ATC CTG GTG CCG CCG GCG 
pro arg thr gin ile leu val pro pro ala 
1591/531 

GTC ATC CTG ACC GCG CTG GCC ACC GCC ATC 
val ile leu thr ala leu ala thr ala ile 
1651/551 

CCG GCG GTG ATC GCT GAC GCC GCG GCC CGC 
pro ala val ile ala asp ala ala ala arg 
1711/571 

AGC GCC GCT CGC GGC CGG TTC AAT GAC GAC 
ser ala ala arg gly arg phe asn asp asp 
1771/591 

CGG CTA TGG AAG CTG ACC TCG GCG TTG ACC 
arg leu trp lys leu thr ser ala leu thr 
1831/611 

CAG TAC ACC GCA CCA CTA CGC GAG GAC ATG 
gin tyr thr ala pro leu arg glu asp met 
1891/631 

GAT ACC CGC AAC GGG CTG GCC CAG CAG CGG 
asp thr arg asn gly leu ala gin gin arg 
1951/651 

CTT TTC GGC GCG GTG ACC ATC GTC AAC CCG 
-leu phe gly ala val thr ile val asn pro 
2011/671 

AGT CCG CTG CCG TTG GCG CTG CAT AAT GGC 
ser pro leu pro leu ala leu his asn gly 
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2041/681 2071/691 

CTC GCC GTG CCA ATC CGG GTC CGG CTA CAG GTC GAT GCT CCG CCC GGG ATG ACG GTG GCC 
leu ala val pro ile arg val arg leu gin val asp ala pro pro gly met thr val ala 
2101/701 2131/711 

GAT GTC GGT CAG ATC GAG CTA CCG CCC GGG TAC CTG CCG CTA CGA GTA CCA ATC GAG GTG 
asp val gly gin ile glu leu pro pro gly tyr leu pro leu arg val pro ile glu val 
2161/721 2191/731 

AAC TTC ACA CAG CGG GTT GCC GTC GAC GTG TCG CTG CGG ACC CCC GAC GGC GTC GCG CTG 
asn phe thr gin arg val ala val asp val ser leu arg thr pro asp gly val ala leu 
2221/741 2251/751 

GGT GAA CCG GTG CGG TTG TCG GTG CAC TCC AAC GCC TAC GGC AAG GTG TTG TTC GCG ATC 
gly glu'pro val arg leu ser val his ser asn ala tyr gly lys val leu phe ala ile 
2281/761 2311/771 

ACG CTA TCC GCT GCG GCC GTG CTG GTA ACG CTG GCG GGC CGG CGC CTT TGG CAC CGG TTC 
thr leu ser ala ala ala val leu val thr leu ala gly arg arg leu trp his arg phe 
2341/781 2371/791 

CGT GGC CAG CCT GAT CGC GCC GAC CTG GAT CGC CCC GAC CTG CCT ACC GGC AAA CAC GCC 
arg gly gin pro asp arg ala asp leu asp arg pro asp leu pro thr gly lys his ala 
2401/801 2431/811 

CCG CAG CGC CGT GCC GTA GCC AGT CGG GAT GAC GAA AAG CAC CGG GTA TGA 
pro gin arg arg ala val ala ser arg asp asp glu lys his arg val) OPA 

seq id mos: 6^0-602 I 
FIGURE 36E (continued 2) 



\ 1/1 31/11 

WATC CGC GCG TTG GCG TCG CAT CCG AAC ATC GTC GGA GTC AAG GAC GCC AAA GCC GAC CTG 
„>(ile arg ala leu ala ser his pro asn ile val gly val lys asp ala lys ala asp leu 
V 61/21 91/31 

CAC AGC GGC GCC CAA ATC ATG GCC GAC ACC GGA CTG GCC TAC TAT TCC GGC GAC GAC GCG 
his ser gly ala gin ile met ala asp thr gly leu ala tyr tyr ser gly asp asp ala 
121/41 151/51 

CTC AAC CTG CCC TGG CTG GCC ATG GGC GCC ACG GGC TTC ATC AGC GTG ATT GCC CAC CTG 
leu asn leu pro trp leu ala met gly ala thr gly phe ile ser val ile ala his leu 
181/61 211/71 

GCA GCC GGG CAG CTT CGA GAG TTG TTG TCC GCC TTC GGT TCT GGG GAT ATC GCC ACC GCC 

ala ala gly gin leu arg glu leu leu ser ala phe gly ser gly asp ile ala thr ala 

241/81 

CGC AAG ATC 

arg lys ile) 
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!/! 31/11 

GAT CCG CGC GTT GGC GTC GCA TCC GAA CAT CGT CGG AGT CAA GGA CGC CAA AGC CGA CCT 



asp pro arg val gly val ala ser glu his arg arg ser gin gly arg gin ser arg pro 

•61/21 91/31 
GCA CAG CGG CGC CCA AAT CAT GGC CGA CAC CGG ACT GGC CTA CTA TTC CGG CGA CGA CGC 
ala gin arg arg pro asn his gly arg his arg thr gly leu leu phe arg arg arg arg 
121/41 151/51 

GCT CAA CCT GCC CTG GCT GGC CAT GGG CGC CAC GGG CTT CAT CAG CGT GAT TGC CCA CCT 
ala gin pro ala leu ala gly his gly arg his gly leu his gin arg asp cys pro pro 
181/61 211/71 

GGC AGC CGG GCA GCT TCG AGA GTT GTT GTC CGC CTT CGG TTC TGG GGA TAT CGC CAC CGC 
gly ser arg ala ala ser arg val val val arg leu arg phe trp gly tyr arg his. arg 
241/81 

CCG CAA GAT C 
pro gin asp) 

seq id M05>: fca^-^S 
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J? 

* 1/1 31/1 1 

^JTCC GCG CGT TGG CGT CGC ATC CGA ACA TCG TCG GAG TCA AGG ACG CCA AAG CCG ACC TGC 
-Vser ala arg trp arg arg ile arg thr ser ser glu ser arg thr pro lys pro thr cys 

V^61/21 91' 31 

ACA GCG GCG CCC AAA TCA TGG CCG ACA CCG GAC TGG CCT ACT ATT CCG GCG ACG ACG CGC 
thr ala ala pro lys ser trp pro thr pro asp trp pro thr ile pro ala thr thr arg 
121/41 151/51 

TCA ACC TGC CCT GGC TGG CCA TGG GCG CCA CGG GCT TCA TCA GCG TGA TTG CCC ACC TGG 
ser thr cys pro gly trp pro trp ala pro arg ala ser ser ala) OPA (leu pro thr trp 
181/61 211/71 1 HV,<*V3 

CAG CCG GGC AGC TTC GAG AGT TGT TGT CCG CCT TCG GTT CTG GGG ATA TCG CCA CCG CCC 
gin pro gly ser phe glu ser cys cys pro pro ser val leu gly ile ser pro pro pro 
241/81 
GCA AGA TC 
ala arg) 

SEQ ID fUDS'. (o3.<b-G>2-& 
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Coding sequence Rv2753c predicted 
containing Seq 37A 

1/1 

IGTG ACC ACC GTC GGA TTC GAC GTC GCA 
'val thr thr val gly phe asp val ala 
W21 

ACA CCG TTT AGC GGC GAT GGC TCC CTG 
thr pro phe ser gly asp gly ser leu 
121/41 

CTG GTC GAT GAG GGG TGC GAC GGT CTG 
leu val asp gin gly cys asp gly leu 
181/61 

ACC ACC GAC GGG GAG AAA ATC GAG CTG 
thr thr asp gly glu lys ile glu leu 
241/81 

GCC CGT GTT ATC GCC GGT GCC GGC ACC 
ala arg val ile ala gly ala gly thr 
301/101 

GCT TGT GCG GCC GAG GGT GCG CAC GGG 
ala cys ala ala glu gly ala his gly 
361/121 

CCG CAG CGG GGG CTG CAA GCC CAT TTC 
pro gin arg gly leu gin ala his phe 
421/141 

CTG CTC TAT GAC ATC CCG GGG CGG TCG 
leu leu tyr asp ile pro gly arg ser 
481/161 

TTG GCG TCG CAT CCG AAC ATC GTC GGA 
leu ala ser his pro asn ile val gly 
541/181 

GCC CAA ATC ATG GCC GAC ACC GGA CTG 
ala gin ile met ala asp thr gly leu 
601/201 

CCC TGG CTG GCC ATG GGC GCC ACG GGC 
pro trp leu ala met gly ala thr gly 
661/221 

CAG CTT CGA GAG TTG TTG TCC GCC TTC 
gin leu arg glu leu leu ser ala phe 
721/241 

AAC ATT GCG GTC GCC CCG CTG TGC AAC 
asn ile ala val ala pro leu cys asn 
781/261 

AAG GCG GGC TTG CGG CTG CAG GGC ATC 
lys ala gly leu arg leu gin gly ile 
841/281 

GCG ACA CCG GAG CAG ATC GAC GCG TTG 
ala thr pro glu gin ile asp ala leu 



117/185 



Cole et al., 1998 (Nature 393 537-544) 



31/11 

GCG CGC CTA GGA ACC CTG CTG ACC GCG ATG GTG 
ala arg leu cly thr leu leu thr ala met val 
91/31 

GAC ACC GCC ACC GCG GCG CGG CTG GCC AAC CAC 
asp thr ala thr ala ala arg leu ala asn his 
151/51 

GTG GTC TCG GGC ACC ACC GGC GAG TCG CCG ACC 
val val ser gly thr thr gly glu ser pro thr 
211/71 

CTG CGG GCC GTC TTG GAA GCG GTG GGG GAC CGG 
leu arg ala val leu glu ala val gly asp arg 
271/91 

TAT GAC ACC GCG CAC AGC ATC CGG CTG GCC AAG 
tyr asp thr ala his ser ile arg leu ala lys 
331/111 

CTG CTG GTG GTC ACG CCC TAC TAT TCC AAG CCG 
leu leu val val thr pro tyr tyr ser lys pro 
391/131 

ACC GCC GTC GCC GAC GCG ACC GAG CTG CCG ATG 
thr ala val ala asp ala thr glu leu pro met 
451/151 

GCG GTG CCG ATC GAG CCC GAC ACG ATC CGC GCG 
ala val pro ile glu pro asp "thr ile 'arg 'ala 
511/171 

GTC AAG GAC GCC AAA GCC GAC CTG CAC AGC GGC 
val lys asp ala lys ala asp leu his ser gly 
571/191 

GCC TAC TAT TCC GGC GAC GAC GCG CTC AAC CTG 
ala tyr tyr ser gly asp asp ala leu asn leu 
631/211 

TTC ATC AGC GTG ATT GCC CAC CTG GCA GCC GGG 
phe ile ser val ile ala his leu ala ala gly 
691/231 

GGT TCT GGG GAT ATC GCC ACC GCC CGC AAG ATC 
gly ser gly asp ile ala thr ala arg lys ile 
751/251 

GCG ATG AGC CGC CTG GGT GGG GTG ACG TTG TCC 
ala met ser arg leu gly gly val thr leu ser 
811/271 

GAC GTC GGT GAT CCC CGG CTG CCC CAG GTG GCC 
asp val gly asp pro arg leu pro gin val ala 
871/291 

GCC GCC GAC ATG CGC GCG GCC TCG GTG CTT CGG 
ala ala asp met arg ala ala ser val leu arg* 



901/301 

TGA 

OPA 
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ORF according to Cole et al . , 1998 (Nature 393 537-544) containing Rv2753c 

2® 1/1 31/11 
VyTAA GGT GAG CGC CGT GGC CGA GAC CGC GCQ GCT GCG CGT GCA ACT GAT CGC CAA GAC CGA 
OCH (gly glu arg arg gly arg asp arg ala ala ala arg ala thr asp arg gin asp arg 
61/21*^ seQX-b HD\ Cp2>X 91/31 

CTT CTT GGC CCC ACC CGA CGT GCC CTG GAC CAC CGA CGC CGA CGG CGG ACC CGC GCT GGT 
leu leu gly pro thr arg arg ala leu asp his arg arg arg arg arg thr arg ala gly 
121/41 151/51 

CGA GTT CGC CGG CCG GGC CTG CTA TCA GAG CTG GTC CAA GCC CAA TCC CAA GAC CGC CAC 
arg val arg arg pro gly leu leu ser glu leu val gin ala gin ser gin asp arg his 
181/61 211/71 

CAA CGC CGG CTA CCT CCG GCA CAT CAT CGA CGT CGG ACA TTT CTC GGT GCT AGA GCA TGC 
gin arg arg leu pro pro ala his his arg arg arg thr phe leu gly ala arg ala cys 
241/81 271/91 

CAG CGT GTC GTT CTA CAT CAC CGG GAT CTC GCG ATC GTG CAC CCA CGA GCT GAT CCG CCA 
gin arg val val leu his his arg asp leu ala ile val his pro arg ala asp pro pro 
301/101 331/111 

CCG GCA TTT CTC CTA CTC GCA GCT CTC CCA GCG CTA CGT ACC CGA GAA GGA CTC GCG GGT 
pro ala phe leu leu leu ala ala leu pro ala leu arg thr arg glu gly leu ala gly 
361/121 391/131 

CGT CGT GCC GCC CGG CAT GGA GGA CGA CGC CGA CCT GCG CCA CAT CCT GAC CGA GGC CGC 
arg arg ala ala arg his gly gly arg arg arg pro ala pro his pro asp arg gly arg 
421/141 451/151 

CGA CGC CGC CCG CGC CAC CTA CAG CGA GCT GCT GGC CAA GCT GGA AGC CAA GTT CGC CGA 
1 arg arg- arg pro arg his leu gin arg ala ala gly- gin ala gly ser gin -val. arg arg 
481/161 511/171 

CCA ACC CAA CGC GAT CCT GCG CCG CAA GCA GGC CCG CCA AGC CGC CCG CGC GGT GCT GCC 
pro thr gin arg asp pro ala pro gin ala gly pro pro ser arg pro arg gly ala ala 
541/181 571/191 

CAA CGC CAC CGA AAC CCG CAT CGT GGT GAC CGG CAA CTA CCG GGC CTG GCG GCA CTT CAT 
gin arg his arg asn pro his arg gly asp arg gin leu pro gly leu ala ala leu his 
601/201 631/211 

CGC AAT GCG GGC CAG CGA GCA CGC CGA CGT GGA AAT CCG GCG ACT GGC CAT CGA ATG CCT 
arg asn ala gly gin arg ala arg arg arg gly asn pro ala thr gly his arg met pro 
661/221 691/231 

GCG CCA GCT CGC CGC CGT GGC CCC CGC GGT GTT CGC CGA CTT CGA GGT GAC CAC CCT GGC 
ala pro ala arg arg arg gly pro arg gly val arg arg leu arg gly asp his pro gly 
721/241 751/251 

CGA CGG CAC CGA GGT GGC GAC CAG CCC GTT GGC GAC CGA AGC CTG AGG CGG CGT GTC GCT 
arg arg his arg gly gly asp gin pro val gly asp arg ser leu arg arg arg val ala 
781/261 811/271 

GGA CAA ACA CGC GCG CTC GCG GCC GGG ATA AAG CGC CAG GTA ACC TTG GGA GCC GTG ACC 
gly gin thr arg ala leu ala ala gly ile lys arg gin val thr leu gly ala val thr 
841/281 871/291 

ACC GTC GGA TTC GAC GTC GCA GCG CGC CTA GGA ACC CTG CTG ACC GCG ATG GTG ACA CCG 
thr val gly phe asp val ala ala arg leu gly thr leu leu thr ala met val thr pro 
901/301 931/311 
--.XTT_AGC GGC GAT GGC TCC CTG GAC ACC GCC ACC GCG GCG CGG CTG GC C AA C CAC CTG GTC 
phe ser gly asp gly ser leu asp thr ala thr ala ala arg leu ala asn his leu val 
961/321 991/331 

GAT CAG GGG TGC GAC GGT CTG GTG GTC TCG GGC ACC ACC GGC GAG TCG CCG ACC ACC ACC 
asp gin gly cys asp gly leu val val ser gly thr thr gly glu ser pro thr thr thr 



seq id Nb5: ^3 1 
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1021/341 

GAC GGG GAG AAA ATC 
asp gly glu lys ile 
1081/361 

GTT ATC GCC GGT GCC 
val ile ala gly ala 
1141/381 

GCG GCC GAG GGT GCG 
ala ala glu gly ala 
1201/401 

CGG GGG CTG CAA GCC 
arg gly leu gin ala 
1261/421 

TAT GAC ATC CCG GGG 
tyr asp ile pro gly 
1321/441 

TCG CAT CCG AAC ATC 
ser his pro asn ile 
1381/461 

ATC ATG GCC GAC ACC 
ile met ala asp thr 
1441/481 

CTG GCC ATG GGC GCC 
leu ala met gly ala 
1501/501 

CGA GAG TTG TTG TCC 
arg glu leu leu ser 
1561/521 

GCG GTC GCC CCG CTG 
ala val ala pro leu 
1621/541 

GGC TTG CGG CTG CAG 
gly leu arg leu gin 
1681/561 

CCG GAG CAG ATC GAC 
pro glu gin ile asp 



GAG CTG CTG CGG GCC 
glu leu leu arg ala 

GGC ACC TAT GAC ACC 
gly thr tyr asp thr 

CAC GGG CTG CTG GTG 
his gly leu leu val 

CAT TTC ACC GCC GTC 
his phe thr ala val 

CGG TCG GCG GTG CCG 
arg ser ala val pro 

GTC GGA GTC AAG GAC 
val gly val lys asp 

GGA CTG GCC TAC JTAT 
gly leu ala tyr tyr 

ACG GGC TTC ATC AGC 
thr gly phe ile ser 

GCC TTC GGT TCT GGG 
ala phe gly ser gly 

TGC AAC GCG ATG AGC 
cys asn ala met ser 

GGC ATC GAC GTC GGT 
gly ile asp val gly 

GCG TTG GCC GCC GAC 
ala leu ala ala asp 



1051/351 

GTC TTG GAA GCG GTG 
val leu glu ala val 
1111/371 

GCG CAC AGC ATC CGG 
ala his ser ile arg 
1171/391 

GTC ACG CCC TAC TAT 
val thr pro tyr tyr 
1231/411 

GCC GAC GCG ACC GAG 
ala asp ala thr glu 
1291/431 

ATC GAG CCC GAC ACG 
ile glu pro asp thr 
1351/451 

GCC AAA GCC GAC CTG 
ala lys ala asp leu 
1411/471 

TCC GGC GAC GAC GCG 
ser gly asp asp ala 
1471/491 

GTG ATT GCC CAC CTG 
val ile ala his leu 
1531/511 

GAT ATC GCC ACC GCC 
asp ile ala thr ala 
1591/531 

CGC CTG GGT GGG GTG 
arg leu gly gly val 
1651/551 

GAT CCC CGG CTG CCC 
asp pro arg leu pro 
1711/571 

ATG CGC GCG GCC TCG 
met arg ala ala ser 



GGG GAC CGG GCC CGT 
gly asp arg ala arg 

CTG GCC AAG GCT TGT 
leu ala lys ala cys 

TCC AAG CCG CCG CAG 
ser lys pro pro gin 

CTG CCG ATG CTG CTC 
leu pro met leu leu 

ATC CGC GCG TTG GCG 
ile arg ala leu ala 

CAC AGC GGC GCC CAA 
his ser gly ala gin 

CTC AAC CTG CCC TGG 
leu asn leu pro trp 

GCA GCC GGG CAG CTT 
ala ala gly gin leu 

CGC AAG ATC AAC ATT 
arg lys ile asn ile 

ACG TTG TCC AAG GCG 
thr leu ser lys ala 

CAG GTG GCC GCG ACA 
gin val ala ala thr 

GTG CTT CGG TGA 
val leu argJOPA 



SEQ ID hlDS: (o^-kZ'X 

FIGURE 37E (continued 1) 



^0* 120/185 

1/1 31/11 
V^J GCG GTG AAC TGG TGG GCC CGG ATG GTT CAA GTA CGC CGT CGC AAA CTC GAG CAC AAC AGG 
2/ala val asn trp trp ala arg met val gin val arg arg arg lys leu glu his asn arg 
61/21 91/31 
y AGA CGA CGG ATG GAA GGA GAT GCT GGC GCC GGC CAG CTG AAC CCT GCC GAT GCG AAT AAG 

arg arg arg met glu gly asp ala gly ala gly gin leu asn pro ala asp ala asn lys 
121/41 151/51 

TCG TCG TCT ACG GAG GTG AAG GCG GCG GAT TCG GCG GAA TCT GAC GCC GGA GCC GAC CAG 
ser ser ser thr glu val lys ala ala asp ser ala glu ser asp ala gly ala asp gin 
181/61 211/71 

ACT GGC CCG CAG GTG AAG GCG GCG GAT TCG GCG GAA TCT GAC GCC GGA GAG CTC GGC GAG 
thr gly pro gin val lys ala ala asp ser ala glu ser asp ala gly glu leu gly glu 
241/81 271/91 

GAC GCG TGC CCA GAA CAG GCC CTC GTC GAG CGG CGC CCG TCG CGG TTG CGG CGA GGC TGG 
asp ala cys pro glu gin ala leu val glu arg arg pro ser arg leu arg arg gly trp 
301/101 331/111 

CTT GTT GGC ATT GCG GCG ACG CTG CTC GCG TTG GCC GGT GGC CTT GGC GCA GCG GGT TAT 
leu val gly ile ala ala thr leu leu ala leu ala gly gly leu gly ala ala gly tyr 
361/121 391/131 

TTT GCG TTG CGC TCA CAC CAG GAA AGC CAA TCA ATC GCG CGC GAG GAC CTT GCG GCC ATT 
phe ala leu arg ser his gin glu ser gin ser ile ala arg glu asp leu ala ala ile 
421/141 451/151 

GAG GCC GCT AAG GAT TGC GTT GCG GCC ACG CAG GCA CCC GAT GCT GGG GCG ATG TCG GCT 
glu ala ala lys asp cys val ala ala thr gin ala pro asp ala gly ala met ser ala 
481/161 

AGC ATG CAG AAG ATC 
ser met gin lys ile) 

SEQ id 633-63^ 



9 gQ 



FIGURE 38A 



31/11 

^/CAG CGG TGA ACT GGT GGG CCC GGA TGG TTC AAG TAC GCC GTC GCA AAC TCG AGC ACA ACA 
gin arg OPA^thr gly gly pro gly trp phe lys tyr ala val ala asn ser ser thr thr 
61/21 ^^eftW? A/o:(p>(tf 91/31 

GGA GAC GAC GGA TGG AAG GAG ATG CTG GCG CCG GCC AGC TGA ACC CTG CCG ATG CGA ATA 
gly asp asp gly trp lys glu met leu ala pro ala ser^OPA^thr leu pro met arg ile 
121/41 151/51 / *\&SQXt> NO\<*2l 

AGT CGT CGT CTA CGG AGG TGA AGG CGG CGG ATT CGG CGG AAT CTG ACG CCG GAG CCG ACC 
ser arg arg leu arg arg)oPA(arg arg arg ile arg arg asn leu thr pro glu pro thr 
i8i/6i ^ecix&ucr (j&Z^r 211/71 

AGA CTG GCC CGC AGG TGA AGG CGG CGG ATT CGG CGG AAT CTG ACG CCG GAG AGC TCG GCG 
arg leu ala arg arg jOPA ( ar^g arg arg ile arg arg asn leu thr pro glu ser ser ala 

241/81 s^a^wc^ury^ 271/91 

AGG ACG CGT GCC CAG AAC AGG CCC TCG TCG AGC GGC GCC CGT CGC GGT TGC GGC GAG GCT 
arg thr arg ala gin asn arg pro ser ser ser gly ala arg arg gly cys gly glu ala 
301/101 331/111 

GGC TTG TTG GCA TTG CGG CGA CGC TGC TCG CGT TGG CCG GTG GCC TTG GCG CAG CGG GTT 
gly leu leu ala leu arg arg arg cys ser arg trp pro val ala leu ala gin arg val 
361/121 391/131 
_ATT™T,TG. CGT TGC GCT CAC ACC AGG A AA GCC AAT CAA TCG CG C GCG A G G ACC TTG CGG CCA 
ile leu arg cys ala his thr arg iys ala asn gin ser arg ala arg thr leu arg pro 
421/141 451/151 

TTG AGG CCG CTA AGG ATT GCG TTG CGG CCA CGC AGG CAC CCG ATG CTG GGG CGA TGT CGG 
leu arg pro leu arg ile ala leu arg pro arg arg his pro met leu gly arg cys arg 
481/161 

CTA GCA TGC AGA AGA TC 
leu ala cys arg ara) 

SEQ ID /Vft5;&35-&2> < 3 



FIGURE 38B 



121/185 



1/1 31/11 
S^AGC GGT GAA CTG GTG GGC CCG GAT GGT TCA AGT ACG CCG TCG CAA ACT CGA GCA CAA CAG 
^•Vser gly glu leu val gly pro asp gly ser ser thr pro ser gin thr arg ala gin gin 
S>1/21 91/31 
GAG ACG ACG GAT GGA AGG AGA TGC TGG CGC CGG CCA GCT GAA CCC TGC CGA TGC GAA TAA 
glu thr thr asp gly arg arg cys trp arg arg pro ala glu pro cys arg cys glu) OCH 
121/41 151/51 

GTC GTC GTC TAC GGA GGT GAA GGC GGC GGA TTC GGC GGA ATC TGA CGC CGG AGC CGA CCA 
ul&f val val val tyr gly gly glu gly gly gly phe gly gly ile\OPA(arg arg ser arg pro 
S81/61 211/71 ' ^^WKM3- 

GAC TGG CCC GCA GGT GAA GGC GGC GGA TTC GGC GGA ATC TGA CGC CGG AGA GCT CGG CGA 
asp trp pro ala gly glu gly gly gly phe gly gly ile) OPA (arg arg arg ala arg arg 
241/81 271/91 ^seQjZbXO: fcl^q 

GGA CGC GTG CCC AGA ACA GGC CCT CGT CGA GCG GCG CCC GTC GCG GTT GCG GCG AGG CTG 
gly arg val pro arg thr gly pro arg arg ala ala pro val ala val ala ala arg leu 
301/101 331/111 

GCT TGT TGG CAT TGC GGC GAC GCT GCT CGC GTT GGC CGG TGG CCT TGG CGC AGC GGG TTA 
ala cys trp his cys gly asp ala ala arg val gly arg trp pro trp arg ser gly leu 
361/121 391/131 

TTT TGC GTT GCG CTC ACA CCA GGA AAG CCA ATC AAT CGC GCG CGA GGA CCT TGC GGC CAT 
phe cys val ala leu thr pro gly lys pro ile asn arg ala arg gly pro cys gly his) 
421/141 451/151 

TGA GGC CGC TAA GGA TTG CGT TGC GGC CAC GCA GGC ACC CGA TGC TGG GGC GAT ..GTC GGC 
OPA gly arg OCH (gly leu arg cys gly his ala gly thr arg cys trp gly asp val gly) 
481/161 v fe S^^towO'.toqS 

TAG CAT GCA GAA GAT C 
AMB (his ala glu as 



seq id Hps: Ctfo-(s>'-i(o 

FIGURE 38C 



/> . 122/185 

Sequence Rv0175 predicted by Cole et al . , 1998 (Nature 393 537-544) and 
containing seq38A 

1/1 31/11 
\AGTG AAG GCG GCG GAT TCG GCG GAA TCT GAC GCC GGA GCC GAC CAG ACT GGC CCG CAG GTG 
{val lys ala ala asp ser ala glu ser asp ala gly ala asp gin thr gly pro gin val 
^61/21 91/31 

AAG GCG GCG GAT TCG GCG GAA TCT GAC GCC GGA GAG CTC GGC GAG GAC GCG TGC CCA GAA 

lys ala ala asp ser ala glu ser asp ala gly glu leu gly glu asp ala cys pro glu 

121/41 151/51 

CAG GCC CTC GTC GAG CGG CGC CCG TCG CGG TTG CGG CGA GGC TGG CTT GTT GGC ATT GCG 
gin ala leu val glu arg arg pro ser arg leu arg arg gly trp leu val gly ile ala 
181/61 211/71 

GCG ACG CTG CTC GCG TTG GCC GGT GGC CTT GGC GCA GCG GGT TAT TTT GCG TTG CGC TCA 
ala thr leu leu ala leu ala gly gly leu gly ala ala gly tyr phe ala leu arg ser 
241/81 271/91 

CAC CAG GAA AGC CAA TCA ATC GCG CGC GAG GAC CTT GCG GCC ATT GAG GCC GCT AAG GAT 
his gin glu ser gin ser ile ala arg glu asp leu ala ala ile glu ala ala lys asp 
301/101 331/111 

TGC GTT GCG GCC ACG CAG GCA CCC GAT GCT GGG GCG ATG TCG GCT AGC ATG CAG AAG ATC 
cys val ala ala thr gin ala pro asp ala gly ala met ser ala ser met gin lys ile 
361/121 391/131 

ATC GAG TGT GGC ACC GGT GAT TTC GGT GCC CAG GCG TCG TTG TAC ACC AGC ATG CTC GTC 
ile glu cys gly thr gly asp phe gly ala gin ala ser leu tyr thr ser met leu val 
421/141 451/151 

GAG GCG TAT CAA GCG GCC AGC GTC CAC GTG CAA GTG ACC GAT ATG CGC GCG GCG GTC GAG 
glu' ala 'tyr' glri ala ala ser val his val gin val thr asp met arg ala ala-val glu 
481/161 511/171 

CGC AAC AAC AAT GAC GGG TCG GTC GAT GTT CTG GTG GCG CTC CGG GTC AAG GTG TCC AAC 
arg asn asn asn asp gly ser val asp val leu val ala leu arg val lys val ser asn 
541/181 571/191 

ACC GAC TCG GAT GCC CAT GAA GTC GGC TAC CGT CTT CGG GTC CGG ATG GCA. CTG GAT GAG 

thr asp ser asp ala his glu val gly tyr arg leu arg val arg met ala leu asp glu 

601/201 631/211 

GGC CGC TAT AAG ATC GCC AAA CTC GAC CAG GTG ACG AAG TGA 

gly arg tyr lys ile ala lys leu asp gin val thr lys^OPA 



seq id /mds: 

FIGURE 38D 



123/185 



VoRF according to Cole et al., 1998 (Nature 393 537-544) containing Rv0175 



h 1/1 31/11 
V(TGA ACT GGT GGG GCC GGA TGG TGT CAA GTA CGC CGT CGC AAA CTC GAG CAC AAC AGG AGA 
OPA(thr gly gly ala gly trp cys gin val arg arg arg lys leu glu his asn arg arg 
61/21 ^SEfrxbNO^bO 91/31 

CGA CGG ATG GAA GGA GAT GCT GGC GCC GGC CAG CTG AAC CCT GCC GAT GCG AAT AAG TCG 
arg arg met glu gly asp ala gly ala gly gin leu asn pro ala asp ala asn lys ser 
121/41 151/51 

TCG TCT ACG GAG GTG AAG GCG GCG GAT TCG GCG GAA TCT GAC GCC GGA GCC GAC CAG ACT 
ser ser thr glu val lys ala ala asp ser ala glu ser asp ala gly ala asp gin thr 
181/61 211/71 

GGC CCG CAG GTG AAG GCG GCG GAT TCG GCG GAA TCT GAC GCC GGA GAG CTC GGC GAG GAC 
gly pro gin val lys ala ala asp ser ala glu ser asp ala gly glu leu gly glu asp 
241/81 271/91 

GCG TGC CCA GAA CAG GCC CTC GTC GAG CGG CGC CCG TCG CGG TTG CGG CGA GGC TGG CTT 
ala cys pro glu gin ala leu val glu arg arg pro ser arg leu arg arg gly trp leu 
301/101 331/111 

GTT GGC ATT GCG GCG ACG CTG CTC GCG TTG GCC GGT GGC CTT GGC GCA GCG GGT TAT TTT 
val gly ile ala ala thr leu leu ala leu ala gly gly leu gly ala ala gly tyr phe 
361/121 391/131 

GCG TTG CGC TCA CAC CAG GAA AGC CAA TCA ATC GCG CGC GAG GAC CTT GCG GCC ATT GAG 
ala leu arg ser his gin glu ser gin ser ile ala arg glu asp leu ala ala ile glu 
421/141 451/151 
1 GCC ' GCT ' AAG' 1 GAT TGC GTT GCG GCC ACG CAG GCA GCC GAT GCT GGG GCG ATG 'TCG GCT AGC 
ala ala lys asp cys val ala ala thr gin ala pro asp ala gly ala met ser ala ser 
481/161 511/171 

ATG CAG AAG ATC ATC GAG TGT GGC ACC GGT GAT TTC GGT GCC CAG GCG TCG TTG TAC ACC 
met gin lys ile ile glu cys gly thr gly asp phe gly ala gin ala ser leu tyr thr 
541/181 571/191 

AGC ATG CTC GTC GAG GCG TAT CAA GCG GCC AGC GTC CAC GTG CAA GTG ACC GAT ATG CGC 
ser met leu val glu ala tyr gin ala ala ser val his val gin val thr asp met arg 
601/201 631/211 

GCG GCG GTC GAG CGC AAC AAC AAT GAC GGG TCG GTC GAT GTT CTG GTG GCG CTC CGG GTC 
ala ala val glu arg asn asn asn asp gly ser val asp val leu val ala leu arg val 
661/221 691/231 

AAG GTG TCC AAC ACC GAC TCG GAT GCC CAT GAA GTC GGC TAC CGT CTT CGG GTC CGG ATG 
lys val ser asn thr asp ser asp ala his glu val gly tyr arg leu arg val arg met 
721/241 751/251 

GCA CTG GAT GAG GGC CGC TAT AAG ATC GCC AAA CTC GAC CAG GTG ACG AAG TGA 
ala leu asp glu gly arg tyr lys ile ala lys leu asp gin val thr lys^OPA 



SEQ ID NOS; (oM<?-&So 
FIGURE 38E 



124/185 



4^1/1 31/11 

<VACA CCT CCC CCC CCG CCG CCG CTG CCG CCG GTT CCC TTT CCC AAG GAA TGT CCG GCG CCG 

5ithr pro pro pro pro pro pro leu pro pro val pro phe pro lys glu cys pro ala pro 

Cfr 61/21 91/31 

-^V*^ GGC GTG ATG CAA GGC TGC CTT GAG AGC ACC AGC GGC TTG ATC ATG GGC ATC GAC'AGC AAG 

gl y val met gin gly cys leu glu ser thr ser gly leu ile met gly ile asp ser lys 

121/41 151/51 

ACC GCA CTG GTC GCC GAG CGC ATC ACC GGT GCC GTC GAG GAG ATC 

thr ala leu val ala glu arg ile thr gly ala val glu glu ile} 

SEQ ID ND5;&S/-&Sa 
FIGURE 39A 

<^1/1 31/11 
^ CAC CTC CCC CCC CGC CGC CGC TGC CGC CGG TTC CCT TTC CCA AGG AAT GTC CGG CGC CGG 
/his leu pro pro arg arg arg cys arg arg phe pro phe pro arg asn val arg arg arg 
./**61/21 91/31 
' -'■ GCG TGA TGC AAG GCT GCC TTG AGA GCA CCA GCG GCT TGA TCA TGG GCA TCG ACA . GCA AGA 

ala^OPACcys lys ala ala leu arg ala pro ala ala)OPA(ser trp ala ser thr ala arg 
^ 121/41 *~$&Qtt>HC>\G>SS 151/51 ^S*QXb*D'.£>S$> 

fy* CCG CAC TGG TCG CCG AGC GCA TCA CCG GTG CCG TCG AGG AGA TC 

^ pro his trp ser pro ser ala ser pro val pro ser arg arg} 

SEQ ID NDS'.fe53-656 

FIGURE 39B 

V 5 !/! 31/11 

<VGAC ACC TCC CCC CCC GCC GCC GCT GCC GCC GGT TCC CTT TCC CAA GGA ATG TCC GGC GCC 

.(asp thr ser pro pro ala ala ala ala ala gly ser leu ser gin gly met ser gly ala 
~/*\l/21 91/31 

GGG CGT GAT GCA AGG CTG CCT TGA GAG CAC CAG CGG CTT GAT CAT GGG CAT CGA CAG CAA 

gly arg asp ala arg leu pro) OPA ^j^u his gin arg leu asp his gly his arg gin gin 
^ 121/41 ^©XbNO^SCp 151/51 

_ GAC CGC ACT GGT CGC CGA GCG CAT CAC CGG TGC CGT CGA GGA GAT C 

y ~ al>p arg thr gly arg arg^llTliTs"his arg "cys arg "arg gly asp} 



SEQ ID ND3:&S7-<oS c i 

FIGURE 39C 



^ 125/185 

/ 

^Coding sequence Rv3006 predicted by Cole et al., 1998 (Nature 393 537-544) and containing 
H seq39A 

1/1 31/11 
O ATG TGG ACA ACG CGG TTG GTT CGA TCC GGA CTC GCC GCG CTG TGC GCG GCA GTG CTG GTA 
-(Met trp thr thr arg leu val arg ser gly leu ala ala leu cys ala ala val leu val 
./61/21 91/31 
.VJT TCG AGC GGC TGC GCA CGG TTC AAC GAC GCT CAA TCT CAG CCG TTC ACC ACC GAA CCG GAG 
^O* ser S er gly cys ala arg phe asn asp ala gin ser gin pro phe thr thr glu pro glu 

j£> 121/41 151/51 

g©* CTG CGG CCC CAA CCC AGC TCG ACA CCT CCC CCC CCG CCG CCG CTG CCG CCG GTT CCC TTT 

* . leu arg pro gin pro ser ser thr pro pro pro pro pro pro leu pro pro val pro phe 

181/61 211/71 

CCC AAG GAA TGT CCG GCG CCG GGC GTG ATG CAA GGC TGC CTT GAG AGC ACC AGC GGC TTG 
pro lys glu cys pro ala pro gly val met gin gly cys leu glu ser thr ser gly leu 
241/81 271/91 

ATC ATG GGC ATC GAC AGC AAG ACC GCA CTG GTC GCC GAG CGC ATC ACC GGT GCC GTC GAG 
ile met gly ile asp ser lys thr ala leu val ala glu arg ile thr gly ala val glu 
301/101 331/111 

GAG ATC TCT ATC AGC GCC GAG CCG AAG GTA AAG ACG GTC ATC CCC GTG GAT CCT GCC GGT 
glu ile ser ile ser ala glu pro lys val lys thr val ile pro val asp pro ala gly 
361/121 391/131 

GAC GGT GGC TTG ATG GAC ATT GTG CTG TCG CCC ACC TAC TCG CAA GAC CGG CTG ATG TAC 
asp gly gly leu met asp ile val leu ser pro thr tyr ser gin asp arg leu met tyr 
421/141 451/151 • 

GCC TAC ATC AGC ACG CCC ACC GAC AAC CGG GTG GTG CGA GTG GCC GAC GGC GAC ATC CCC ' 
ala tyr ile ser thr pro thr asp asn arg val val arg val ala asp gly asp ile pro 
481/161 511/171 

. AAG, GAC, ATC,, CTG ,ACC .GGC ATC CCC AAA GGT GCT GCC GGT AAC ACC GGG GCG CTG ATC TTC 

lys asp ile leu thr gly ile pro lys gly ala ala giy asn thr gly ala leu ile phe f 
541/181 571/191 

ACC AGT CCC ACC ACG CTG GTC GTG ATG ACC GGG GAT GCT GGC GAC CCG GCG TTG GCC GCC 
thr ser pro thr thr leu val val met thr gly asp ala gly asp pro ala leu ala ala 
601/201 631/211 

GAT CCC CAA TCG TTG GCC GGT AAG GTC CTG CGT ATC GAA CAG CCC ACC ACC ATC GGC CAG 
asp pro gin ser leu ala gly lys val leu arg ile glu gin pro thr thr ile gly gin 
661/221 691/231 

ACG CCG CCG ACG ACG GCG CTG TCT GGC ATC GGC TCC GGC GGC GGC TTG TGC ATC GAT CCG 
thr pro pro thr thr ala leu ser gly ile gly ser gly gly gly leu cys ile asp pro 
721/241 751/251 

GTC GAC GGC TCG CTA TAT GTC GCC GAC CGC ACG CCA ACG GCG GAC CGA TTG CAG CGC ATC 
val asp gly ser leu tyr val ala asp arg thr pro thr ala asp arg leu gin arg ile 
781/261 811/271 

ACC AAG AAC TCG GAG GTC TCT ACG GTA TGG ACC TGG CCG GAC AAG CCC GGC GTG GCC GGG 
thr lys asn ser glu val ser thr val trp thr trp pro asp lys pro gly val ala gly 
841/281 871/291 

TGT GCC GCG ATG GAC GGC ACC GTG CTG GTC AAC CTG ATT AAT ACC AAA CTG ACG GTG GCG 
cys ala ala met asp gly thr val leu val asn leu ile asn thr lys leu thr val ala 
901/301 931/311 

GTC CGG CTC GCG CCG TCG ACC GGT GCG GTC ACC GGA GAA CCC GAC GTT GTC CGC AAA GAC 
val arg leu ala pro ser thr gly ala val thr gly glu pro asp val val arg lys asp 
961/321 991/331 

ACT CAT GCG CAT GCG TGG GCA TTA CGG ATG TCG CCG GAC GGC AAC GTC TGG GGA GCC ACC 
thr his ala his ala trp ala leu arg met ser pro asp gly asn val trp gly ala thr 

: 1G21 y- 3 41 10517-351- 



GTC AAC AAG ACC GCC GGC GAC GCC GAG AAG CTC GAC GAT GTG GTG TTC CCG CTG TTC CCG 
val asn lys thr ala gly asp ala glu lys leu asp asp val val phe pro leu phe pro 
1081/361 1111/371 
CAG GGT GGC GGC TTC CCG CGC AAC AAC GAC GAC AAG ACC TGA 
gin gly gly gly phe pro arg asn asn asp asp lys thrjOPA 

SEQ ID hlOS\ CokD-kk I 



FIGURE 39D 



126/185 



0» 



S ORF according to Cole et al . , 1998 (Nature 393 537-544) and containing Rv3006 

# 1/1 31/11 
Vs^TAA GGC CAT TTA GTG CCG AAT TGG GGA TTT GAG CGG CGC TTT CGC CAG ACA ATC CGC ACA 
OCH(gly his leu val pro asn trp gly phe glu arg arg phe arg gin thr ile arg thr 
61/2lC^6"Q ;^b/SfO:<pfc3 91/31 

TTG ACC CTG ACC AGC CCA CCA AAA GGC CCC AAT TGG GCC GCC ATG CCG ACA GTG CGC ACC 
leu thr leu thr ser pro pro lys gly pro asn trp ala ala met pro thr val arg thr 
121/41 151/51 

CCG GCA GGT GGC GGC GAT GCC CAC AAT GTC CGT AGC CTG TCG GTC ATG TGG ACA ACG CGG 
pro ala gly gly gly asp ala his asn val arg ser leu ser val met trp thr thr , arg 
181/61 211/71 

TTG GTT CGA TCC GGA CTC GCC GCG CTG TGC GCG GCA GTG CTG GTA TCG AGC GGC TGC GCA 
leu val arg ser gly leu ala ala leu cys ala ala val leu val ser ser gly cys ala 
241/81 271/91 

CGG TTC AAC GAC GCT CAA TCT CAG CCG TTC ACC ACC GAA CCG GAG CTG CGG CCC CAA CCC 
arg phe asn asp ala gin ser gin pro phe thr thr glu pro glu leu arg pro gin pro 
301/101 331/111 

AGC TCG ACA CCT CCC CCC CCG CCG CCG CTG CCG CCG GTT CCC TTT CCC AAG GAA TGT CCG 
ser ser thr pro pro pro pro pro pro leu pro pro val pro phe pro lys glu cys pro 
361/121 391/131 * - 

GCG CCG GGC GTG ATG CAA GGC TGC CTT GAG AGC ACC AGC GGC TTG ATC ATG GGC ATC GAC 
ala pro gly val met gin gly cys leu glu ser thr ser gly leu ile met gly ile asp 

421/141 . , 451/151 ,. , . 

AGC AAG ACC GCA CTG GTC GCC GAG CGC ATC ACC GGT GCC GTC GAG GAG ATC TCT ATC AGC 
ser lys thr ala leu val ala glu arg ile thr gly ala val glu glu ile ser ile ser 
481/161 511/171 

GCC GAG CCG AAG GTA AAG ACG GTC ATC CCC GTG GAT CCT GCC GGT GAC GGT GGC TTG ATG 
ala glu pro lys val lys thr val ile pro val asp pro ala gly asp gly gly leu met 
541/181 571/191 

GAC ATT GTG CTG TCG CCC ACC TAG TCG CAA GAC CGG CTG ATG TAC GCC TAC ATC AGC ACG 
asp ile val leu ser pro thr tyr ser gin asp arg leu met tyr ala tyr ile ser thr 
601/201 631/211 

CCC ACC GAC AAC CGG GTG GTG CGA GTG GCC GAC GGC GAC ATC CCC AAG GAC ATC CTG ACC 
pro thr asp asn arg val val arg val ala asp gly asp ile pro lys asp ile leu thr 
661/221 691/231 

GGC ATC CCC AAA GGT GCT GCC GGT AAC ACC GGG GCG CTG ATC TTC ACC AGT CCC ACC ACG 
gly ile pro lys gly ala ala gly asn thr gly ala leu ile phe thr ser pro thr thr 
721/241 751/251 

CTG GTC GTG ATG ACC GGG GAT GCT GGC GAC CCG GCG TTG GCC GCC GAT CCC CAA TCG TTG 
leu val val met thr gly asp ala gly asp pro ala leu ala ala asp pro gin ser leu 
781/261 811/271 

GCC GGT AAG GTC CTG CGT ATC GAA CAG CCC ACC ACC ATC GGC CAG ACG CCG CCG ACG ACG 
ala gly lys val leu arg ile glu gin pro thr thr ile gly gin thr pro pro thr thr 
841/281 871/291 

GCG CTG TCT GGC ATC GGC TCC GGC GGC GGC TTG TGC ATC GAT CCG GTC GAC GGC TCG CTA 
ala leu ser gl y ile g ly ser g ly gl y gly leu cys ile asp pro val asp gly s er leu 
901/301 931/311 

TAT GTC GCC GAC CGC ACG CCA ACG GCG GAC CGA TTG CAG CGC ATC ACC AAG AAC TCG GAG 
tyr val ala asp arg thr pro thr ala asp arg leu gin arg ile thr lys asn ser glu 



seq id NDS: G>6>2-6>6>2> 



FIGURE 39E 



127/185 



961/321 

GTC TCT ACG GTA TGG 
val ser thr val trp 
1021/341 

GGC ACC GTG CTG GTC 
gly thr val leu val 
1081/361 

TCG ACC GGT GCG GTC 
ser thr gly ala val 
1141/381 

TGG GCA TTA CGG ATG 
trp ala leu arg met 
1201/401 

GGC GAC GCC GAG AAG 
gly asp ala glu lys 
1261/421 

CCG CGC AAC AAC GAC 
pro arg asn asn asp 



ACC TGG CCG GAC AAG 
thr trp pro asp lys 

AAC CTG ATT AAT ACC 
asn leu ile asn thr 

ACC GGA GAA CCC GAC 
thr gly glu pro asp 

TCG CCG GAC GGC AAC 
ser pro asp gly asn 

CTC GAC GAT GTG GTG 
leu asp asp val val 

GAC AAG ACC TGA 
asp lys thr\ OPA 



991/331 

CCC GGC GTG GCC GGG 
pro gly val ala gly 
1051/351 

AAA CTG ACG GTG GCG 
lys leu thr val ala 
1111/371 

GTT GTC CGC AAA GAC 
val val arg lys asp 
1171/391 

GTC TGG GGA GCC ACC 
val trp gly ala thr 
1231/411 

TTC CCG CTG TTC CCG 
phe pro leu phe pro 



TGT GCC GCG ATG GAC 
cys ala ala met asp 

GTC CGG CTC GCG CCG 
val arg leu ala pro 

ACT CAT GCG CAT GCG 
thr his ala his ala 

GTC AAC AAG ACC GCC 
val asn lys thr ala 

CAG GGT GGC GGC TTC 
gin gly gly gly phe 



SEQ ID f^cS: G£>2*-(e&lb 
FIGURE 39.E (continued) 

1/1 31/11 



GAA GGC 


CTT 


GTT 


GAG 


CCG 


GCG 


CAC 


GAA 


AAC 


GAT CGT 


TGT 


GTG 


TAC 


ATT 


GGT 


GTG 


TAT 


GGC 


glu gly 


leu 


val 


glu 


pro 


ala 


his 


glu 


asn 


asp arg 


cys 


val 


tyr 


ile 


gly 


val 


tyr 


gly 


61/21 


















91/31 


















TCG GTT 


GAA 


CGT 


GTA 


TGT 


GCC 


CGA 


CGA 


ATT 


GGC GGA 


GCG 


CGC 


CAG 


GGC 


GCG 


GGG 


CTT 


GAA 


ser va 1 


glu 


arg 


val 


cys 


ala 


arg 


arg 


ile 


gly gly 


ala 


arg 


gin 


gly 


ala 


gly 


leu 


glu 


121/41 


















151/51 


















CGT CTC 


GGC 


GCT 


GAC 


TCA 


GGC 


CGC 


GAT 


CAG 


TGC CGA 


GTT 


GGA 


GAA 


CTC 


CGC 


AAC 


CGA 


TGC 


arg leu 


gly 


ala 


asp 


ser 


gly 


arg 


asp 


gin 


cys arg 


val 


gly 


glu 


leu 


arg 


asn 


arg 


cys 


181/61 


















211/71 


















GTG GCT 


TGA 


GGG 


GTT 


GGA 


ACC 


CAG 


AAG 


CAC 


CGG CGC 


TCG 


GCA 


TGA 


TGA 


CGT 


GCT 


GGG 


TGC 


val al^ 


lOPA< 


fgly 


val 


gly 


thr 


gin 


lys 


his 


arg arg 


ser 


ala) 


► OPA 


(OPA 


arg 


ala 


gly 


cys 


241/81 








271/91 










GAT CGA 


TGC 


CGC 


TCG 


CGA 


TGA 


GTT 


CGA 


AGC 


GTG AGA 


GCA 


TCG 


CCC 


ACT 


TCG 


CCG 


CCG 


GAG 


asp arg 


cys 


arg 


ser 


arg" 


)OPA< 


(val 


arg 


ser 


val arg 


ala 


ser 


pro 


thr 


ser 


pro 


pro 


glu 


301/101 












331/111 


















CAG GTG 


GTC 


GTC 


GAC 


GCG 


AGT 


GCC 


ATG 


GTG 


GAT C 


















gin val 


val 


val 


asp 


ala 


ser 


ala 


met 


val 


asp) 



















seq id M 05 ". (o&\-(J&'> c \h\- ^ 33 

FIGURE 40A 



V- 128/185 



xn 31/11 



KJt AAG GCC 


TTG 


TTG 


AGC 


CGG 


CGC 


ACG 


AAA 


ACG 


ATC GTT 


GTG 


TGT 


ACA 


TTG 


GTG 


TGT 


ATG 


GCT 


J^riys ala 
u\ N 61/21 


leu 


leu 


ser 


arg 


arg 


thr 


lys 


thr 


ile val 


val 


cys 


thr 


leu 


val 


cys 


met 


ala 




































• CGG TTG 


AAC 


GTG 


TAT 


GTG 


CCC 


GAC 


GAA 


TTG 


GCG GAG 


CGC 


GCC 


AGG 


GCG 


CGG 


GGC 


TTG 


AAC 


arg leu 


asn 


val 


tyr 


val 


pro 


asp 


glu 


leu 


ala glu 


arg 


ala 


arg 


ala 


arg 


gly 


leu 


asn 


121/41 


















151/51 


















GTC TCG 


GCG 


CTG 


ACT 


CAG 


GCC 


GCG 


ATC 


AGT 


GCC GAG 


TTG 


GAG 


AAC 


TCC 


GCA 


ACC 


GAT 


GCG 


val ser 


ala 


leu 


thr 


gin 


ala 


ala 


ile 


ser 


ala glu 


leu 


glu 


asn 


ser 


ala 


thr 


asp 


ala 


181/61 


















211/71 


















TGG CTT 


GAG 


GGG 


TTG 


GAA 


CCC 


AGA 


AGC 


ACC 


GGC GCT 


CGG 


CAT 


GAT 


GAC 


GTG 


CTG 


GGT 


GCG 


trp leu 


glu 


gly 


leu 


glu 


pro 


arg 


ser 


thr 


gly ala 


arg 


his 


asp 


asp 


val 


leu 


giy 


ala 


241/81 
















271/91 


















ATC GAT 


GCC 


GCT 


CGC 


GAT 


GAG 


TTC 


GAA 


GCG 


TGA GAG 


CAT 


CGC 


CCA 


CTT 


CGC 


CGC 


CGG 


AGC 


ile asp 


ala 


ala 


arg 


asp 


glu 


phe 


glu 


ala] 


|OPA(glu 


his 


arg 


pro 


leu 


arg 


arg 


arg 


ser 


301/101 


















33lAir 












AGG TGG 


TCG 


TCG 


ACG 


CGA 


GTG 


CCA 


TGG 


TGG 


ATC 


















arg trp 


ser 


ser 


thr 


arg 


val 


pro 


trp 


trp 


ile) 



















SEQ ID KD5\ &&>(o- b>(&% 
FIGURE 40B 



$y i/i 31/n 

9\mAGG CCT TGT TGA GCC GGC GCA CGA AAA CGA TCG TTG TGT GTA CAT TGG TGT GTA TGG CTC 
^~arg pro cys OPA (ala gly ala arg lys arg ser leu cys val his trp cys val trp leu 
61/21 SPQ N*>'. 91/31 

GGT TGA ACG TGT ATG TGC CCG ACG AAT TGG CGG AGC GCG CCA GGG CGC GGG GCT TGA ACG 
gly)oPA(thr cys met cys pro thr asn trp arg ser ala pro gly arg gly ala)oPA(thr 
121/41 Vse6>Xt>MO: fell 151/51 <£QXt>NO%<*Tl*V-^ 

TCT CGG CGC TGA CTC AGG CCG CGA TCA GTG CCG AGT TGG AGA ACT CCG CAA CCG ATG CGT 
ser arg arg)oPA(leu arg pro arg ser val pro ser trp arg thr pro gin pro met arg 
181/61 3 211/71 

GGC TTG AGG GGT TGG AAC CCA GAA GCA CCG GCG CTC GGC ATG ATG ACG TGC TGG GTG CGA 
gly leu arg gly trp asn pro glu ala pro ala leu gly met met thr cys trp val arg 
241/81 271/91 

TCG ATG CCG CTC GCG ATG AGT TCG AAG CGT GAG AGC ATC GCC CAC TTC GCC GCC GGA GCA 
ser met pro leu ala met ser ser lys arg glu ser ile ala his phe ala ala gly ala 
301/101 

GGT GGT CGT CGA CGC GAG TGC CAT GGT GGA TC 
gly gly arg arg arg glu cys his gly gly) 



SEQ ID HC&'.C^^-IdI^ 



FIGURE 40C 



129/185 




/^Coding sequence Rv0549c predicted by Cole et al.> 1998 (Nature -393 : 537-544 ) and 
^i 9 containing seq40A 

^ 1/1 31/11 

igtg aga gca teg ccc act teg ccg ccg gag cag gtg gtc gtc gac gcg agt gec atg gtg 

/(val arg ala ser pro thr ser pro pro glu gin val val val asp ala ser ala met val 
61/21 91/31 
,fy gat eta ctg get cgc act age gat egg tgc tct gcg gtg cgc gcg egg ctg get egg ace 
y asp leu leu ala arg thr ser asp arg cys ser ala val arg ala arg leu ala arg thr 
121/41 151/51 
gcg atg cac gcg ccg gcg cac ttc gat gca gag gtg ttg teg gcg ctg ggg cgc atg cag 
ala met his ala pro ala his phe asp ala glu val leu ser ala leu gly arg met gin 
181/61 211/71 

cgc gee ggc gca etc ace gtt gee tat gtc gat gcg gca ctg gag gag ttg cga cag gtg 
arg ala gly ala leu thr val ala tyr val asp ala ala leu glu glu leu arg gin val 
241/81 271/91 

ccg gtg act cga cac ggt ctt teg teg ctg ctt get gga. gcg tgg teg cgc cgc gac acc 
pro val thr arg his gly leu ser ser leu leu ala gly ala trp ser arg arg asp thr 
301/101 331/111 

etc cgc ctg acc gat gee etc tac gtc gag ctg gec gaa acg gca ggt ctg gtg ttg ttg 
leu arg leu thr asp ala leu tyr val glu leu ala glu thr ala gly leu val leu leu 
361/121 391/131 

acc acc gac gaa aga ttg gca cgc gee tgg ccc teg get cac gee ate ggc tga - - 
thr thr asp glu arg leu ala arg ala trp pro ser ala his ala ile gly^OPA 

SEQ ID fslps\ 6?74~ kis 

FIGURE 40D 

Jo 

^ ORF according to Cole et al . , 1998 (Nature 393:537-544) and containing Rv0549c 
1/1 31/11 

|tga gtt cga age gtg aga gca teg ccc act teg ccg ccg gag cag gtg gtc gtc gac gcg 
OPAlyal arg ser val arg ala ser pro thr ser pro pro glu gin val val val asp ala 

eim^seaxfc Kio'.kTl 91/31 

agt gee atg gtg gat eta ctg get cgc act age gat egg tgc tct gcg gtg cgc gcg egg 

ser ala met val asp leu leu ala arg thr ser asp arg cys ser ala val arg ala arg 
121/41 151/51 

ctg get egg acc gcg atg cac gcg ccg gcg cac ttc gat gca gag gtg ttg teg gcg ctg 

leu ala arg thr ala met his ala pro ala his phe asp ala glu val leu ser ala leu 
181/61 211/71 

ggg cgc atg cag cgc gec ggc gca etc acc gtt gee tat gtc gat gcg gca ctg gag gag 

gly arg met gin arg ala gly ala leu thr val ala tyr val asp ala ala leu glu glu 
241/81 271/91 

ttg cga cag gtg ccg gtg act cga cac ggt ctt teg teg ctg ctt get gga gcg tgg teg 

leu arg gin val pro val thr arg his gly leu ser ser leu leu ala gly ala trp ser 
30T/101 ~ -■- - ~ 331/111 



cgc cgc gac acc etc cgc ctg acc gat gee etc tac gtc gag ctg gec gaa acg gca ggt 

arg arg asp thr leu arg leu thr asp ala leu tyr val glu leu ala glu thr ala gly 

361/121 391/131 

ctg gtg ttg ttg acc acc gac gaa aga ttg gca cgc gec tgg ccc teg get cac gee 

leu val leu leu thr thr asp glu arg leu ala arg ala trp pro ser. ala his ala 



ate 
ile 



421/141 
ggc tga 
glyJOPA 

seq id NOV. 77 



FIGURE 40E 



130/185 



<=> in 31/11 

SfCCT GGC CGG GAC GCC TAC GTG TAG CCC GCG GCT AGC ACA GGA TAG CCA TTG TTG TGC GGT 

( pro giy arg asp ala tyr val)AMB(pro ala ala ser thr gly)AMB(pro leu leu cys gly 
$ 61/21 ^©irbXtxMD- 7 91 / 31 v seorDND:^( 

AGC GCC AAA ACG ATC AGC CCT TCG CGG ACA TGT CAG CAC CCG CCT TGG CCG GGA GAG CGG 
ser ala lys thr ile ser pro ser arg thr cys gin his pro pro trp pro gly glu arg 
121/41 151/51 

CGT CGT GAC CGT GCT GTC ACC ACG TCT GGT TAG GCT CGG GGC GCG GGC TGG CGC GGA GGA 
arq arg asp arg ala val thr thr ser gly)AMB(ala arg gly ala gly trp arg gly gly 
181/61 211/71 / ^S*C?Xb*D: 

GGT GTG TTG CGG AGG AGG TGT GTT GTA GTG GGG ACG GCG GAT CGG CCG TTG GAC GCC TCG 
gly val leu arg arg arg cys val val val gly thr ala asp arg pro leu asp ala ser 
241/81 271/91 
GCC TTG CGG GAC TGG GCA CAC GCC GTC GTC AGC GAT C 
ala leu arg asp trp ala his ala val val ser asp) 



SEQ ID MPs: &7g- 
FIGURE 41A 



#1/1 31/11 

V CTG GCC GGG ACG CCT ACG TGT AGC CCG CGG CTA GCA CAG GAT AGC CAT TGT TGT GCG GTA 
(leu ala gly thr pro thr cys ser pro arg leu ala gin asp ser his cys cys ala val 
T 61/21 91/31 
^ GCG CCA AAA CGA TCA GCC CTT CGC GGA CAT GTC AGC ACC CGC CTT GGC CGG GAG AGC GGC 
9 ala pro lys arg ser ala leu arg gly his val ser thr arg leu gly arg glu ser gly 
121/41 151/51 

GTC GTG ACC GTG CTG TCA CCA CGT CTG GTT AGG CTC GGG GCG CGG GCT GGC GCG GAG GAG 
val val thr val leu ser pro arg leu val arg leu gly ala arg ala gly ala glu glu 
181/61 211/71 

GTG TGT TGC GGA GGA GGT GTG TTG- TAG TGG GGA CGG CGG ATC GGC CGT TGG ACG CCT CGG 
val cys cys gly gly gly val leu)AMB(trp gly arg arg ile gly arg trp thr pro arg 

241/81 5GQ3bNO\WS>_ 271/91 

CCT TGC GGG ACT GGG CAC ACG CCG TCG TCA GCG ATC 
pro cys gly thr gly his thr pro ser ser ala ile^ 



SEQ ID A/CS; 6,^3- 6^5 



FIGURE 4 IB 



131/185 



.S3 

/ ( 
J? 



1 /I 

1/1 


















31/11 




















GGA 


CGC 


CTA 


CGT 


GTA 


GCC 


CGC 


GGC 


TAG CAC 


AGG 


ATA 


GCC 


ATT 


GTT 


GTG 


CGG 


TAG 


trp pro 


gly 


arg 


leu 


arg 


val 


ala 


arg 


giy) 


AMBfhis 


arg 


ile 


ala 


ile 


val 


val 


arg) 


IAMB 


61/21 
















91/31 fx> /V£>: 6S9 










CGC CAA 


AAC 


GAT 


CAG 


CCC 


TTC 


GCG 


GAC 


ATG 


TCA GCA 


CCC 


GCC 


TTG 


GCC 


GGG 


AGA 


GCG 


GCG 


arg gin 


asn 


asp 


gin 


pro 


phe 


ala 


asp 


met 


ser ala 


pro 


ala 


leu 


ala 


gly 


arg 


ala 


ala 


121/41 


















151/51 


















TCG TGA 


CCG 


TGC 


TGT 


CAC 


CAC 


GTC 


TGG 


TTA 


GGC TCG 


GGG 


CGC 


GGG 


CTG 


GCG 


CGG 


AGG 


AGG 


ser)oPA| 


(pro 


cys 


cys 


his 


his 


val 


trp 


leu 


gly ser 


gly 


arg 


gly 


leu 


ala 


arg 


arg 


arg 


181/61 












211/71 


















TGT GTT 


GCG 


GAG 


GAG 


GTG 


TGT 


TGT 


AGT 


GGG 


GAC GGC 


GGA 


TCG 


GCC 


GTT 


GGA 


CGC 


CTC 


GGC 


cys val 


ala 


glu 


glu 


val 


cys 


cys 


ser 


gly 


asp gly 


gly 


ser 


ala 


val 


gly 


arg 


leu 


gly 


241/81 


















271/91 


















CTT GCG 


GGA 


CTG 


GGC 


ACA 


CGC 


CGT 


CGT 


CAG 


CGA TC 


















leu ala 


gly 


leu 


gly 


thr 


arg 


arg 


arg 


gin 


arg) 



















SEQ ID 1SJD5*. tcKfa-fcrt® 
FIGURE 41C 



a- 

Coding sequence Rv2975c predicted by Cole et al, 1998 (Nature 393: 537-544) and 
^ containing seq41A 



r 



1/1 


















31/11 


















gtg ggg 


acg 


gcg 


gat 


egg 


ccg 


ttg 


gac 


gec 


teg gee 


ttg 


egg 


gac 


tgg 


gca 


cac 


gee 


gtc 


val gly 


thr 


ala 


asp 


arg 


pro 


leu 


asp 


ala 


ser ala 


leu 


arg 


asp 


trp 


ala 


his 


ala 


val 


'61/21 


















91/31 


















gtc age 


gat 


ctg 


ate 


etc 


cac 


ate 


gac 


gag 


ate aac 


egg 


etc 


aat 


gtg 


ttc 


ccg 


gtc 


get 


val ser 


asp 


leu 


ile 


leu 


his 


ile 


asp 


glu 


ile asn 


arg 


leu 


asn 


val 


phe 


pro 


val 


ala 


121/41 


















151/51 


















gac tec 


gat 


acc 


ggc 


gtc 


aac 


atg 


ctg 


ttc 


acc atg 


cgt 


gec 


gcg 


gtc 


gta 


gaa 


get 


gat 


asp ser 


asp 


thr 


gly 


val 


asn 


met 


leu 


phe 


thr met 


arg 


ala 


ala 


val 


val 


glu 


ala 


asp 


181/61 


















211/71 


















ttg cac 


gcg 


aat 


teg 


cag 


get 


gac 


gee 


gaa 


gac gtg 


gcg 


egg 


gtt 


gcg 


gec 


get 


etc 


gcg 


leu his 


ala 


asn 


ser 


gin 


ala 


asp 


ala 


glu 


asp val 


ala 


arg 


val 


ala 


ala 


ala 


leu 


ala 


241/81 




































gec ggc 


gcg 


cgt 


tga 






























ala gly 


ala 


arg] 


|OPA 































seq id N05;&9i-kA3- 



FIGURE 41D 



132/185 



ORF according to Cole et al, 1998 (Nature 393: 537-544) and containing Rv2975c 



1/1 31/11 



tag get 


egg 


ggc 


gcg 


ggc 


tgg 


cgc 


gga 


gga 


ggt gtg 


ttg 


egg 


agg 


agg 


tgt 


gtt 


gta 


gtg 


•AMB (a la 


arg 


gly 


ala 


gly 


trp 


arg 


gly 


gly 


gly val 


leu 


arg 


arg 


arg 


cys 


val 


val 


val 


61/21^£^GlX-E>NO: fe*?** 












91/31 


















ggg acg 


gcg 


gat 


egg 


ccg 


ttg 


gac 


gec 


teg 


gec ttg 


egg 


gac 


tgg 


gca 


cac 


gec 


gtc 


gtc 


gly thr 


ala 


asp 


arg 


pro 


leu 


asp 


ala 


ser 


ala leu 


arg 


asp 


trp 


ala 


his 


ala 


val 


val 


121/41 


















151/51 


















age gat 


ctg 


ate 


etc 


cac 


ate 


gac 


gag 


ate 


aac egg 


etc 


aat 


gtg 


ttc 


ccg 


gtc 


get 


gac 


ser asp 


leu 


ile 


leu 


his 


ile 


asp 


glu 


ile 


asn arg 


leu 


asn 


val 


phe 


pro 


val 


ala 


asp 


181/61 


















211/71 


















tec gat 


acc 


ggc 


gtc 


aac 


atg 


ctg 


ttc 


acc 


atg cgt 


gec 


gcg 


gtc 


gta 


gaa 


get 


gat 


ttg 


ser asp 


thr 


gly 


val 


asn 


met 


leu 


phe 


thr 


met arg 


ala 


ala 


val 


val 


glu 


ala 


asp 


leu 


241/81 
















271/91 


















cac gcg 


aat 


teg 


cag 


get 


gac 


gec 


gaa 


gac 


gtg gcg 


egg 


gtt 


gcg 


gee 


get 


etc 


gcg 


gec 


his ala 


asn 


ser 


gin 


ala 


asp 


ala 


glu 


asp 


val ala 


arg 


val 


ala 


ala 


ala 


leu 


ala 


ala 



301/101 

ggc gcg cgt tga 
gly ala arg)OPA 



SEQ ID NOS>- fc^^-loSH 
FIGURE 41E 



sequence Rv 2974C predicted by Cole et al . (Nature 393:537-544) and which may be 
in the same reading frame as Seq41D. The sequencing of this region reveals, in 
one case out of three, a deletion of two nucleotides 

?>1/1 31/11 

ttg aac gga get cgc ggc aac tec ggc gtg ate ctg tec cag ate ctg cgc ggg ate gca 
"^leu asn gly ala arg gly asn ser gly val ile leu ser gin ile leu arg gly ile ala 
^61/21 91/31 
gag gtg acc gcg act gcg gee gee gee tct ggc gcg gta ttg egg gcg gtc gac gee aac 
glu val thr ala thr ala ala ala ala ser gly ala val leu arg ala val asp ala asn 
121/41 151/51 

gee etc ggg gee gcg ttg tgg cgc ggc gtc gag ttg gtc gtc gcg teg atg ggt ggc gtg 
ala leu gly ala ala leu trp arg gly val glu leu val val ala ser met gly gly val 
181/61 211/71 

gag gtg ccg gga act ate gtc teg gtg ctg egg gec gec gec gga gec gtc gac cag tgc 
glu val pro gly thr ile val ser val leu arg ala ala ala gly ala val asp gin cys 
241/81 271/91 
■ —gcg- -cac gag~ggg~ttg-gcc ggt~geg~ gtc- acc gee- gec ggt gac gcg gcg^.g,tc„ate .gcg ctg 
ala his glu gly leu ala gly ala val thr ala ala gly asp ala ala val ile ala leu 
301/101 331/111 

gaa aag acc ccc gaa cag ctt gac gtg etc gee gat gcg ggc gcg gtg gac gee ggc gga 
glu lys thr pro glu gin leu asp val leu ala asp ala gly ala val asp ala gly gly 



seq id ^C6;695~<b96 

FIGURE 4 IF 



133/185 



361/121 

egg ggc ctg ctg gtt ctg ctg gac gcg ttg 
arg gly leu leu val leu leu asp ala leu 
421/141 

egg gcg gtc tac gaa ccc teg ccg cgc gcg 
arg ala val tyr glu pro ser pro arg ala 
481/161 

gec ccg caa ttc gag gtg atg tat ctg ttg 
ala pro gin phe glu val met tyr leu leu 
541/181 

ttg egg gat cga etc aag gaa ttg ggt gag 
leu arg asp arg leu lys glu leu gly glu 
601/201 

age tac tec gta cac gtc cac acc gac gac 
ser tyr ser val his val his thr asp asp 
661/221 

gtg ggg cga gtt age egg ate gtg ate teg 
val gly arg val ser arg ile val ile ser 
721/241 

gee ggt ggc tgg acg egg ggc cgc gee gtg 
ala gly gly trp thr arg gly arg ala val 
781/261 

gag ctg ttc gee ggg gag ggc gee tgc gtg 
glu leu phe ala gly glu gly ala cys val 
841/281 

gee gee gat ate agt gec cac cag ctg gtg 
ala ala asp ile ser ala his gin leu val 
901/301 

gtg atg gtg ctg ccc aat ggc tat gtg gec 
val met val leu pro asn gly tyr val ala 
961/321 

gcg ate ggc tgg ggc gtc gac gtg gta ccc 
ala ile gly trp gly val asp val val pro 
1021/341 

gee gcg ctg gec gtg cat gac gcg gee cgc 
ala ala leu ala val his asp ala ala arg 
1081/361 

cgt gec gec ggt get tec egg cac gga teg 
arg ala ala gly ala ser arg his gly ser 
1141/381 

tgg gee ggt acc tgc aag ccg ggc gac ggt 
trp ala gly thr cys lys pro gly asp gly 
1201/401 

gtc gee gac gat gtc gec gcg gcg gee ate 
val ala asp asp val ala ala ala ala ile 
1261/421 

ggc gat ctg gtg acg gtg eta att ggc gee 
gly asp leu val thr val leu ile gly ala 

132 1/441 ___ 

gaa egg cat gtg cac gac cac cat cca ggc 
glu arg his val his asp his his pro gly 
1381/461 

cgc ggc gac gcg ctg ctg ate ggg gtc gag 
arg gly asp ala leu leu ile gly val glu 1 



391/131 

cgc tec acc ate tgc ggg cag gca cct gec 
arg ser thr ile cys gly gin ala pro ala 
451/151 

ttg ccg acc gac acg get acc caa cgc ccc 
leu pro thr asp thr ala thr gin arg pro 
511/171 

gcg gta tgt gat get gca gcg gcg gac cag 
ala val cys asp ala ala ala ala asp gin 
571/191 

teg gtg gec ate gee get get, ccg ccc gac 
ser val ala ile ala ala ala pro pro asp 
631/211 

gee ggt gee gec gtg gaa gee gga ttg gcg 
ala gly ala ala val glu ala gly leu ala 
691/231 

gcg etc ggt tec ggg acc age gga ttg ccg 
ala leu gly ser gly thr ser gly leu pro 
751/251 

ctg gcg gtc gtc gac ggc gac ggt gee gec 
leu ala val val asp gly asp gly ala ala 
811/271 

ctg cga ccg ggt cca gac gee gtg aca ccg 
leu arg pro gly pro asp ala val thr pro 
871/291 

egg gec gtg gta gac acc . .ggc gee..,, gcg . cac 
arg ala val val asp thr gly ala ala his 
931/311 

gec gaa gaa ctg gtg gec ggg tgt acc gcg 
ala glu glu leu val ala gly cys thr ala 
991/331 

gtg ccg acc gga teg atg gtg cag ggg ttg 
val pro thr gly ser met val gin gly leu 
1051/351 

cag gee gtc gac gac ggc tac age atg gec 
gin ala val asp asp gly tyr ser met ala 
1111/371 

gtg cgc att gee acc caa aag gcg ctg acc 
val arg ile ala thr gin lys ala leu thr 
1171/391 

ctg ggt ate gcg ggc gac gag gtg ctg ate 
leu gly ile ala gly asp glu val leu ile 
1231/411 

ggt ctg gtc gac ctg ttg ttg gca teg gga 
gly leu val asp leu leu leu ala ser gly 
1291/431 

ggc gta acc gaa gac gtg get gtc gtc ctg 
gly val thr glu asp val ala val val leu 

1351/451 

acc gag ctg gtc" tec tac "cgc acc gga cac 
thr glu leu val ser tyr arg thr gly his 
1411/471 
tag 
AMB 
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Seq41T comprising seq 41F and seq 41S 

1/1 ^ft****"" 1 * 9, S6«» wt> - in 31/11 
:flXt>W£^n tta ggc teg ggg cgc ggg ctg gcg egg agg agg tgt gtt gcg gag gag gtg tgt tgt agt 
"^ffcUtftoW leu gly ser gly arg gly leu ala arg arg arg cys val ala glu glu val cys cys ser 
* AMHiala arg gly ala gly trp arg gly gly gly val leu arg arg arg cys val val val 

y* noV*"^nirg leu gly ala arg ala gly ala glu glu val cys cys gly gly gly val leu)AMB(trp 
A** 1 ** ^^6f/21 91/31 ^STOXfcN*- 

t^*^ ggg gac ggc gga teg gec gtt gga cgc etc ggc ctt gcg gga ctg ggc aca cgc cgt cgt 
s£P gly asp gly gly ser ala val gly arg leu gly leu ala gly leu gly thr arg arg arg 

. gly thr ala asp arg pro leu asp ala ser ala leu arg asp trp ala his ala val val 
aly arg arg ile gly arg trp thr pro arg pro cys gly thr gly his thr pro ser 'ser 
121/41 151/51 

cag cga tct gat cct cca cat cga cga gat caa ccg get caa tgt gtt ccc ggt cgc tga 
gin arg ser asp pro pro his arg arg asp gin pro ala gin cys val pro gly arg)OPA 
ser asp leu ile leu his ile asp glu ile asn arg leu asn val phe pro val ala asp 
ala ile^OPA/ser ser thr ser thr arg ser thr gly ser met cys ser arg ser leu thr 
181/61 ^^«fQat>M0MO£> 211/71 

etc cga tac egg cgt caa cat get gtt cac cat gcg tgc cgc ggt cgt aga age tga ttt vV iT>* n 
itf ^u(leu arg tyr arg arg gin his ala val his his ala cys arg gly arg arg ser^OPAfehe ^ Jl> K 
r$>* ^ ser asp thr gly val asn met leu phe thr met arg ala ala val val glu ala asp leu 

^ Dro He pro ala ser thr cys cys ser pro cys val pro arg ser)AMB(lys leu ile cys 

241/81 271/91 V ^SBQZbM&.liy 

gca cgc gaa ttc gca ggc tga cgc cga aga cgt ggc gcg ggt tgc ggc cgc tct cgc ggc 
ala arg glu phe ala gl _) OP^arg arg arg arg gly ala gly cys gly arg ser arg gly 
his ala asn ser gin ala (asp ala glu asp val ala arg val ala ala ala leu ala ala 
thr arg ile arg arg leujthr pro lys thr trp arg gly leu arg pro leu ser arg pro 

JW .301/101 S6&3S>*t&10\f , . „, 331/111 , , H 

egg cgc gcg ttg aac gga get cgc ggc aac tec ggc gtg ate ctg tec cag ate ctg cgc 
arg arg ala leu asn gly ala arg gly asn ser gly val ile leu ser gin ile leu arg 
gly ala arg) OP3,(thr glu leu ala ala thr pro ala)OPAfeer cys pro arg ser cys ala 
ala ara val (qlu arg ser ser arg gin leu arg arg asp7pro val pro asp pro ala arg 
361/121 V^Art>wb'.-tt>S 391/131 > 705 

ggg ate gca gag gtg ace gcg act gcg gee gee gee tct ggc gcg gta ttg egg gcg gtc 
gly ile ala glu val thr ala thr ala ala ala ala ser gly ala val leu arg ala val 
gly ser gin arg^)OPA(pro arg leu arg pro pro pro leu ala arg tyr cys gly arg ser 
asp arg arg gly asp/arg asp cys gly arg arg leu trp arg gly ile ala gly gly arg 

421/141 c^edabMOiiou 451/151 

gac gee aac gec etc ggg gec gcg ttg tgg cgc ggc gtc gag ttg gtc gtc gcg teg atg 
asp ala asn ala leu gly ala ala leu trp arg gly val glu leu val val ala ser met 
thr pro thr pro ser gly pro arg cys gly ala ala ser ser trp ser ser arg arg trp 
arg gin arg pro arg gly arg val val ala arg arg arg val gly arg arg val asp gly 
481/161 511/171 

ggt ggc gtg gag gtg ccg gga act ate gtc teg gtg ctg egg gec gee gec gga gee gtc 
gly gly val glu val pro gly thr ile val ser val leu arg ala ala ala gly ala val 
val ala trp arg cys arg glu leu ser ser arg cys cys gly pro pro pro glu pro ser 
trp arg gly gly ala gly asn tyr arg leu gly ala ala gly arg arg arg ser arg arg 
541/181 S71/191 

gac cag tgc gcg cac gag ggg ttg gee ggt gcg gtc ace gee gec ggt gac gcg gcg gtc 
asp gin cys ala his glu gly leu ala gly ala val thr ala ala gly asp ala ala val 
thr ser ala arg thr arg gly trp pro val arg ser pro pro pro val thr arg arg ser 
pro val arg ala arg gly val gly arg cys gly his arg arg arg) OPA ( a ^ly gly his 

601/201 631/211 

a-tc-geg. ctg gaa _aag_acc„cce„gaa„ cag__ct _t .jgacjgtg .etc _gcc gat gc5r._g.gc gcg gtg gac 

ile ala leu glu lys thr pro glu gin leu asp val leu ala asp ala gly ala val asp 
ser arg trp lys arg pro pro asn ser leu thr cys- ser pro met arg ala arg trp thr 
ara ala gly lys asp pro arg thr ala^ OPA(aro_ala arg arg cys gly arg gly gly arg 
66 1/I 2 1 J 691/f3_N^QI-C>N^; r Taa 



gec ggc gga egg ggc ctg ctg gtt ctg ctg gac gcg ttg cgc tec acc ate tgc ggg cag 
ala gly gly arg gly leu leu val leu leu asp ala leu arg ser thr ile cys gly gin 
pro ala asp gly ala cys trp phe cys trp thr arg cys ala pro pro ser ala gly arg 
arg arg thr gly pro ala gly ser ala gly arg val ala leu his his 1 u arg ala gly 
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721/241 751/251 

gca cct gcc egg gcg gtc tac gaa ccc teg ccg cgc gcg ttg ccg acc gac acg get acc 
ala pro ala arg ala val tyr glu pro ser pro arg ala leu pro thr asp thr ala thr 
his leu pro gly arg ser thr asn pro arg arg ala arg cys arg pro thr arg leu pro 
thr cys pro gly gly leu arg thr leu ala ala arg val ala asp arg his gly tyr pro 
781/261 811/271 

caa cgc ccc gcc ccg caa ttc gag gtg atg tat ctg ttg gcg gta tgt gat get gca gcg 
gin arg pro ala pro gin phe glu val met tyr leu leu ala val cys asp ala ala ala 
asn ala pro pro arg asn ser arg) OPA \gys ile cys trp arg tyr val met leu gin arg 
thr pro arg pro ala ile arg gly asp^val ser val gly gly met)oPA cys cys ser gly 
841/281 ScTQtnbKlOllOl— ^ 871/291 

gcg gac cag ttg egg gat cga etc aag gaa ttg ggt gag teg gtg gcc ate gcc get get 
ala asp gin leu arg asp arg leu lys glu leu gly glu ser val ala ile ala ala ala 
arg thr ser cys gly ile asp ser arg asn trp val ser arg trp pro ser pro leu leu 
gly pro val ala gly ser thr gin gly ile gly)oPA£ral gly gly his arg arg cys ser 
901/301 931/311 VstfGrrt>MO\ 

ccg ccc gac age tac tec gta cac gtc cac acc gac gac gcc ggt gcc gcc gtg gaa gcc 
pro pro asp ser tyr ser val his val his thr asp asp ala gly ala ala val glu ala 
arg pro thr ala thr pro tyr thr ser thr pro thr thr pro val pro pro trp lys pro 
ala arg gin leu leu arg thr arg pro. his arg arg arg arg cys arg arg gly ser arg 
961/321 991/331 

gga ttg gcg gtg ggg cga gtt age egg ate gtg ate teg gcg etc ggt tec ggg acc age 
gly leu ala val gly arg val ser arg ile val ile ser ala leu gly ser gly thr ser 
asp trp arg trp gly glu leu ala gly ser^OPA^er arg arg ser val pro gly pro ala - 
ile gly gly gly ala ser)AMB (pro asp arg asp leuNgly ala arg phe arg asp gin arg 

1021/341 setkz&KvniL* — ^ 1051/351 h sbq-x^ho^oz 

gga ttg ccg gcc ggt ggc tgg acg ego; ggc cgc gcc gtg ctg gcg gtc gtc gac ggc gac 
gly leu pro ala gly gly trp thr arg gly arg ala val leu ala val val asp. gly asp 
asp cys arg pro val ala gly arg gly ala ala pro cys trp arg ser ser thr ala thr 
ile ala gly arg trp leu asp ala gly pro arg arg ala gly gly arg arg arg arg arg 
1081/361 1111/371 

ggt gcc gcc gag ctg ttc gcc ggg gag ggc gcc tgc gtg ctg cga ccg ggt cca gac gcc 
gly ala ala glu leu phe ala gly glu gly ala cys val leu arg pro gly pro asp ala 
val pro pro ser cys ser pro gly arg ala pro ala cys cys asp arg val gin thr pro^ 
cys arg arg ala val arg arg gly gly arg leu arg ala ala thr gly ser arg arg arg 
1141/381 1171/391 

gtg aca ccg gcc gcc gat ate agt gcc cac cag ctg gtg egg gcc gtg gta gac acc ggc 
val thr pro ala ala asp ile ser ala his gin leu val arg ala val val asp (thr gly 
OPAjfhis arg pro pro ile ser val pro thr ser trp cys gly pro trp^AMB thrtoro ala 
l^^asp t^r gly arg arg tyr gin cys pro pro ala gly ala gly arg gly arg hid arg arg 
1201/401 1231/411 t0SS.&±*>\'\\x>*S 

gcc gcg cac gtg atg gtg ctg ccc aat ggc tat gtg gcc gcc gaa gaa ctg gtg gcc ggg 
ala ala his val met val leu pro asn gly tyr val ala ala glu glu leu val ala gly 
pro arg thr\oPA(t^rp cys cys pro met ala met trp pro pro lys asn trp trp pro gly 
arg ala arg asp%ly ala ala gin trp leu cys gly arg arg arg thr gly gly arg val 
1261/421 C5£M:C>rtD\-m 1291/431 

tgt acc gcg gcg ate ggc tgg ggc gtc gac gtg gta ccc gtg ccg acc gga teg atg gtg 
cys thr ala ala ile gly trp gly val asp val val pro val pro thr gly ser met val 
val pro arg arg ser ala gly ala ser thr trp tyr pro cys arg pro asp arg trp cys 
tyr arg gly asp arg leu gly arg arg arg gly thr arg ala asp arg ile asp gly ala 
1321/441 1351/451 

cag ggg ttg gcc gcg ctg gcc gtg cat gac gcg gcc cgc cag gcc gtc gac gac ggc tac 
gln-gly leu_ala„ala_leu_ala _yal_hi,s„as P ala ala arg gin ala val asp asp gly tyr 
arg gly trp pro arg trp pro cys met thr arg pro ala arg pro ser thr thr ala thr 
gly val gly arg ala gly arg alcTjOPAQjrg gly pro pro gly arg arg arg arg leu gin 
1381/461 <;gPQrt^V4P'. 7^ 1 ^ 1411/471 

age atg gcc cgt gcc gcc ggt get tec egg cac gga teg gtg cgc att gcc acc caa aag 
ser met ala arg ala ala gly ala ser arg his gly ser val arg ile ala thr gin lys 
ala trp pro val pro pro val leu pro gly thr asp arg cys ala leu pro pro lys arg 
his gly pro cys arg arg cys phe pro ala arg ile gly ala his cys his pro lys gly 
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1441/481 1471/491 

aca eta acc tgg gee ggt acc tgc aag ccg ggc gac ggt ctg ggt ate gcg ggc gac gag 
111 ill thr trp ala gly thr cys lys pro gly asp gly leu gly ile ala gly asp glu 
araVPA?pro gly pro val pro ala ser arg ala thr val trp val ser arg ala thr arg 
afl asp^eu gly arg tyr leu gin ala gly arg arg ser gly tyr arg gly arg arg gly 

all ctg ate gtc gee gae gat gtc gec gcg gcg gec ate ggt ctg gtc gac ctg ttg ttg 
111 HI ile Sal ala Lp asp val ala ala ala ala ile gly leu val asp leu leu leu 
eysWfser ser pro thr met ser pro arg arg pro ser val trp ser thr cys cys trp 
ali aspVarg arg arg arg eys arg arg S lv 5 |^ 5 £ s arg ser gly arg pr ° Val ^ gly 

aca 1 icg\ga & g?c E ?at°ct| , gtg aeg gtg eta att'ggc'gcc ggc gta acc gaa gac gtg get 
Ila ser lit Hy «p leu val thr val leu ile gly ala gly val thr glu asp val ala 
hL arg glu ala ile trp>PA arg eys O C H (leu ala pro ala)oc H (pro lys thr trp leu 
ile gly arg arg ser gli asp gly ala/Ss^rp arg arg arg asn£rg ar ? ^ cys 

1621/541 saax&Nonvw/ i65i/5si >seo>xt»io. 7 is 

etc gtc ctg gaa egg cat ?tg cle gac eac cat cca ggc acc gag ctg gtc tec tac cgc 
val ™1 111 glu arg his vai his asp his his pro gly thr glu leu val ser tyr arg 
ser ser trp asn gly met cys thr thr thr ile gin ala pro ser trp ser pro thr ala 
arg P ro g^y thr ala cys ala arg pro pro ser arg his arg ala gly leu leu pro his 
1681/561 1711/571 
acc gga cac cgc ggc gac gcg ctg ctg ate ggg gtc gag.tag 
thr gly his arg gly asp ala leu leu ile gly val glujAMB 
pro asp thr ala ala thr arg cys)oPA(ser gly ser; ser 
arg thr pro arg arg arg ala ala asp.arg gly arg val 

*" iCQXt> /VO'.llfe 



5^Dt:LbM»:<^7--fc7<continue<i 2) 
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^lX>^° ,;la * GCC GGT AAC GCC GCG TCC CAG TGC TAT CCG TC^GCC GGA CCG CCC GAA ACA TCA GCG GCG 
g!y -n ala ala ser gin cys tyr pro ser^la gly pro pro glu thr ser ala ala 

61/21 CGG GCT CGA cVgCT CCA CCT GGC CAT CAG CGA CCA GGT 

g ?Y aS pro til gly arg gly arg ala arg pro^.l. pro pro gly his gin arg pro gly 

TAt'cGA GGT GGA AGC GGA CGG TGT TGG GAT GCA CGC CCA ACT TGC CGG CGA TCG CGG CGA 
tyr arg gly gly ser gly arg cys trp asp ^arg pro thr cys arg arg ser arg arg 

TGc'tCA TCG GAA CCC GCG ACG CAC ACA ATG CCc'gCA GCA CCG CAC GAC GGC GCC CCA CCG 
cys ITr ser glu pro ala thr his thr met pro ala ala pro his asp gly ala pro pro 

241/81 GCA GTG ACC. TGA TGA TGA,CAC TCA CCc'cCA TAA GGC TCG TCG GCT GCG CCT GJfl 

txi>No:-7-"a(- 

GAG GTG GGG TCT ATG 
glu val gly ser met 
361/121 #>u»* > -Ht , " ,- ' A 
AAC AAA CGT GGC AAT GCC GGG CAG CCT CTG CCC TTG TCG GAT C 



CCT CTT GCA GTG ACC TGA TGA TGA CAC TCA CCC CCA TAA GGC TW, iw, 
S Tel lit thr>OPA CPA OPA^s ser pro pro>OCH ser ser a a ala pro 

-cITtGC AGT^AG^TT^S^gIc^ tS'^A^CT G^OTG^GG TCT ATG GCC 
Sn cys sir ^s phe thr gin thr asp leu) OCH fas pro ala glu val gly ser met ala 

361/121 agi/iaiNseax&HoriBx 

AAC AAA CGT GGC AAT GCC GGG CAG CCT CTG CCC TTG TCG GAT C 
asn lys arg gly asn ala gly gin pro leu pro leu ser asp) 
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I"* 

7 1/1 31/11 
V,; CCG GTA ACG CCG CGT CCC AGT GCT ATC CGf CCG CCG GAC CGC CCG AAA CAT CAG CGG CGG 
pro val thr pro arg pro ser ala ile arg pro pro asp arg pro lys his gin arg arg 
61/2T""* 5e&*Of40; 1-^4 91/31 

GCG CCC CGG TCG GCC GCG GCC GGG CTC GAC CCG CTC CAC CTG GCC ATC AGC GAC CAG GTT 
ala pro arg ser ala ala ala gly leu asp pro leu his leu ala ile ser asp gin val 
121/41 151/51 

ATC GAG GTG GAA GCG GAC GGT GTT GGG ATG CAC GCC CAA CTT GCC GGC GAT CGC GGC GAT 
ile glu val glu ala asp gly val gly met his ala gin leu ala gly asp arg gly asp 
181/61 211/71 

GCT CAT CGG AAC CCG CGA CGC ACA CAA TGC CCG CAG CAC CGC ACG ACG GCG CCC CAC CGG 
ala his arg asn pro arg arg thr gin cys pro gin his arg thr thr ala pro his arg 
241/81 271/91 

CTC TTG CAG TGA CCT GAT GAT GAC ACT CAC CCC CAT AAG GCT CGT CGG CTG CGC CTG AGC 
leu leu gln)oPA (pro asp asp asp thr his pro his lys ala arg arg leu arg leu ser 
301/101 ^ SGEttb 331/111 

AAT GCA GTA AGT TTA CAC AAA CGG ACT TGT AAA AAC CTG CGG AGG TGG GGT CTA TGG CCA 

asn ala val ser leu his lys arg thr cys lys asn leu arg arg trp gly leu trp pro 

361/121 391/131 

ACA AAC GTG GCA ATG CCG GGC AGC CTC TGC CCT TGT CGG ATC 

thr asn val ala met pro gly ser leu cys pro cys arg ile^ 



SEQ ID ^ 73 3 _-, ss 
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1/1 31/11 
^oVlSt CGG TAA CGC CGC GTC CCA GTG CTA TCC GTC CGC CGG ACC GCC CGA AAC ATC AGC GGC GGG 
£Q&* arg OCH (arg arg val pro val leu ser val arg arg thr ala arg asn ile ser gly gly 

61/21 Vs^QIbWO'.l^T 91/31 

CGC CCC GGT CGG CCG CGG CCG GGC TCG ACC CGC TCC ACC TGG CCA TCA GCG ACC AGG TTA 
arg pro gly arg pro arg pro gly ser thr arg ser thr trp pro ser ala thr arg leu 
121/41 151/51 

TCG AGG TGG AAG CGG ACG GTG TTG GGA TGC ACG CCC AAC TTG CCG GCG ATC GCG GCG ATG 
ser arg trp lys arg thr val leu gly cys thr pro asn leu pro ala ile ala ala met 
181/61 211/71 

CTC ATC GGA ACC CGC GAC GCA CAC AAT GCC CGC AGC ACC GCA CGA CGG CGC CCC ACC GGC 
leu ile gly thr arg asp ala his asn ala arg ser thr ala arg arg arg pro thr gly 
241/81 271/91 

TCT TGC AGT GAC CTG ATG ATG ACA CTC ACC CCC ATA AGG CTC GTC GGC TGC GCC TGA GCA 
cys ser asp leu met met thr leu thr pro ile arg leu val gly cys alajoPA ala 



ser 



301/101 331/1 11 

ATG CAG TAA GTT TAC ACA AAC GGA CTT GTA AAA ACC TGC GGA GGT GGG GTC TAT GGC CAA 

met gin OCH (yal tyr thr asn gly leu val lys thr cys gly gly gly val tyr gly gin 

361/121 ^Se&TfcNC*!^ 391/131 

CAA ACG TGG CAA TGC CGG GCA GCC TCT GCC CTT GTC GGA. TC 

gin thr trp gin cys arg ala ala ser ala leu val gly^ 
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Coding sequence Rv2622 predicted by Cole et al., 1998 (Nature 393:537-544) and 
containing seq42A: 

1/1 31/11 

atg gcc aac aaa cgt ggc aat gcc ggg cag cct ctg ccc ttg teg gat cga gac gac gac 
■bVMf'T*^ Metala asn lys arg gly asn ala gly gin pro leu pro leu ser asp arg asp asp asp 
61/23^ 2>6G>:fcfcj ho: 7 vo 91/31 

cac atg cag ggg cac tgg ctg ctg gcc egg ctg ggc aag egg gtg ctg cgt ccc ggc ggc 
• his met gin gly his trp leu leu ala arg leu gly lys arg val leu arg pro gly gly 
121/41 151/51 

gtc gaa etc acc egg aca ctg ctg gcc cgc gcc gag gtg ace gac gcc gac gtg etc gag 
val glu leu thr arg thr leu leu ala arg ala glu val thr asp ala asp val leu glu 
181/61 211/71 

ctg gca ccg ggc ctg ggc cgc acc gca gcc gaa ate ttg gcc cgc aac ccg egg teg tac 
leu ala pro gly leu gly arg thr ala ala glu ile leu ala arg asn pro arg ser tyr 
241/81 271/91 

gtg ggg gcg gag age gat ccc aac gcg gcc aac ctg gtc cga cac gtt etc gcc ggc cgc 
val gly ala glu ser asp pro asn ala ala asn leu val arg his val leu ala gly arg 
301/101 331/111 

ggc gac gtc egg gtc acc gac gcg gcc gat acc gga tta tec gac gcc age gcc gat gtc 
qly asp val arg val thr asp ala ala asp thr gly leu ser asp ala ser ala. asp val 
361/121 391/131 

gtc ate ggc gag gcg atg ctg acc atg caa ggc aac gcg get aaa cac acg ate gtc gcc 
val ile gly glu ala met leu thr met gin gly asn ala ala lys his thr ile val ala 
■ " • 421/141 451/151 ' ' ' 

gag gcg gcg egg gtg ctg agg ccg ggt ggc cgc tac gcg att cac gaa eta gcg ctg gtg 
glu ala ala arg val leu arg pro gly gly arg tyr ala ile his glu leu ala leu val 
481/161 511/171 

ccg gac gac gtc gca gag cag gtc cgc acc gac ctg egg cag teg ctg gcc cgc gcg etc 
pro asp asp val ala glu gin val arg thr asp leu arg gin ser leu ala arg ala leu 
541/181 571/191 

aag gtc aat gcg cgt ccg ctg acc gtt gcg gaa tgg teg cac etc tta gcg ggc cat gga 
lys val asn ala arg pro leu thr val ala glu trp ser his leu leu ala gly his gly 
601/201 631/211 

ctg gtc gtc gaa cac gtt gtc acc get tec atg gcg ttg tta caa ccg cga egg gtg ate 
leu val val glu his val val thr ala ser met ala leu leu gin pro arg arg val lie 
661/221 691/231 

get gac gaa ggc etc ctg ggt gcg ctg egg ttc gcc gga aac ctg etc ate cat cgt gcc 
ala asp glu gly leu leu gly ala leu arg phe ala gly asn leu leu ile his arg ala 
721/241 751/251 

gcg cgt egg cga gtc ctg ttg atg cgc cac aca ttc cgc agg cat cgt gaa cgc ttg aca 

ala arg arg arg val leu leu met arg his thr phe arg arg his arg glu arg leu thr 

781/261 811/271 

gcc gtc gcc att gtc gcg cac aaa ccg cac gtc gat teg tga 

ala val ala ile val ala his lys pro his val asp ser)OPA 



seq id rv/o^: -J3«f- iqo 

FIGURE 4 2D 



139/185 



ORF according to Cole et al . , 1998 (Nature 393:537-544) and containing Rv2622 

1/1 31/11 
u{ taa aaa cct gcg gag gtg ggg tct atg gcc aac aaa cgt ggc aat gcc ggg cag cct ctg 
^O' 1 * OCHClys pro ala glu val gly ser met ala asn lys arg gly asn ala gly gin pro leu 
0K£ 61/21 SfftobNO' IHa 91/31 

ccc ttg teg gat cga gac gac gac cac atg cag ggg cac tgg ctg ctg gcc egg ctg ggc 
pro leu ser asp arg asp asp asp his met gin gly his trp leu leu ala arg leu gly 
121/41 151/51 

aag egg gtg ctg cgt ccc ggc ggc gtc gaa etc ace egg aca ctg ctg gcc cgc gcc gag 
lys arg val leu arg pro gly gly val glu leu thr arg thr leu leu ala arg ala glu 
181/61 211/71 

gtg acc gac gcc gac gtg etc gag ctg gca ccg ggc ctg ggc cgc acc gca gcc gaa ate 
val thr asp ala asp val leu glu leu ala pro gly leu gly arg thr ala ala glu ile 
241/81 271/91 

ttg gcc cgc aac ccg egg teg tac gtg ggg gcg gag age gat ccc aac gcg gcc aac ctg 
leu ala arg asn pro arg ser tyr val gly ala glu ser asp pro asn ala ala asn leu 
301/101 331/111 

gtc cga cac gtt etc gcc ggc cgc ggc gac gtc egg gtc acc gac gcg gcc gat acc gga 
val arg his val leu ala gly arg gly asp val arg val thr asp ala ala asp thr gly 
361/121 391/131 - . 

tta tec gac gcc age gcc gat gtc gtc ate ggc gag gcg atg ctg acc atg caa ggc aac 
leu ser asp ala ser ala asp val val ile gly glu ala met leu thr met gin gly asn 

421/141 . t 451/151 „ .. , , 

gcg get aaa cac acg ate gtc gcc gag gcg gcg egg gtg ctg agg ccg ggt ggc cgc tac 
ala ala lys his thr ile val ala glu ala ala arg val leu arg pro gly gly arg tyr 
481/161 511/171 

gcg att cac gaa eta gcg ctg gtg ccg gac gac gtc gca gag cag gtc cgc acc gac ctg 
ala ile his glu leu ala leu val pro asp asp val ala glu gin val arg thr asp leu 
541/181 571/191 

egg cag teg ctg gcc cgc gcg etc aag gtc aat gcg cgt ccg ctg acc gtt gcg gaa tgg 
arg gin ser leu ala arg ala leu lys val asn ala arg pro leu thr val ala glu trp 
601/201 631/211 

teg cac etc tta gcg ggc cat gga ctg gtc gtc gaa cac gtt gtc acc get tec atg gcg 
ser his leu leu ala gly his gly leu val val glu his val val thr ala ser met ala 
661/221 691/231 

ttg tta caa ccg cga egg gtg ate get gac gaa ggc etc ctg ggt gcg ctg egg ttc gcc 
leu leu gin pro arg arg val ile ala asp glu gly leu leu gly ala leu arg phe ala 
721/241 751/251 

gga aac ctg etc ate cat cgt gcc gcg cgt egg cga gtc ctg ttg atg cgc cac aca ttc 
gly asn leu leu ile his arg ala ala arg arg arg val leu leu met arg his thr phe 
781/261 811/271 

cgc agg cat cgt gaa cgc ttg aca gcc gtc gcc att gtc gcg cac aaa ccg cac gtc gat 

arg arg his arg glu arg leu thr ala val ala ile val ala his lys pro his val asp 

841/281 

teg tga 

ser^OPA 



SEQ ID No5:7^/-7^ 
FIGURE 42E 



140/185 



1/1 31/11 
, ate gcg cgt gac ate gat gac cag ggt egg ctg tgt ctg gac gtc ggc ggt cga acg gta 

5>6RJ^ n^l e ala ar 9 as P ile as P asp gln gly arg leu cys leu asp val gly gly arg thr Val 

«^V40MWH 7 61/21 91 / 31 

5eR3 ^ gtt gtt tea gcg ggc gac gtg gtg cat ttg cgt taa etc gcg egg age tgg cgt ccc caa 

val val ser ala qly asp val val his leu argJOCHfleu ala arg ser trp arg pro gin 

121/41 1S1V51 v ^saxfcNo^HS 

aag att aag gtc gcg ggc atg age tat ccg gag aat gtc ctg gec get ggc gag cag gtc 
lys ile lys val ala gly met ser tyr pro glu asn val leu ala ala gly glu gin val 
181/61 211/71 

gtt ctg cac cgc cat ccg cac tgg aat cgc tta ate tgg ccc gtc gtg gtg ctg gtc ttg 
val leu his arg his pro his trp asn arg leu ile trp pro val val val leu val leu 
241/81 271/91 

ctg ace ggg ttg gcg gcg ttc ggg tec gga ttc gtc aac teg aca cct tgg cag cag ate 
leu thr gly leu ala ala phe gly ser gly phe val asn ser thr pro trp gin gin ile) 

seq id mds: iH?>-m5 

FIGURE 43A 



1/1 31/11 
e.gSlXb^^^ teg cgc gtg aca teg atg ace agg gtc ggc tgt gtc tgg acg teg gcg gtc gaa egg tag 
* /ser arg val thr ser met thr arg val gly cys val trp thr ser ala val glu argJAMB 

61/21>*-S^*fc 1H1 91/31 

ttg ttt cag egg gcg acg tgg tgc att tgc gtt aac teg cgc gga get ggc gtc ccc aaa 
jfleu phe gin arg ala thr trp cys ile cys val asn ser arg gly ala gly val pro 



lys 



^O^* 121/41 151/51 



s^ 5 ^ aga tta agg teg egg gca tga get ate egg aga atg tec tgg ccg ctg gcg age agg teg 

arg leu arg ser arg ala ^OPA (a^ a ile arg arg met ser trp pro leu ala ser arg ser 
181/61 <rs£QXfctIoV7M? 211/71 

ttc tgc ace gee ate cgc act gga ate get taa tct ggc ccg teg tgg tgc tgg tct tgc 
phe cys thr ala ile arg thr gly ile ala/OCH ^er gly pro ser trp cys trp ser cyj) 
241/81 271/91*»ft3rftVlo:-7SO 

tga ccg ggt tgg egg cgt teg ggt ccg gat teg tea act cga cac ctt ggc age aga tc 
OPA £ro gly trp arg arg ser gly pro asp ser ser thr arg his leu gly ser argj 



SEQ ID NDS . m^ l5( 



FIGURE 43B 



141/185 



1/1 31/11 

cgc gcg tga cat cga tga cca ggg teg get gtg tct gga cgt egg egg teg aac ggt agt 
^>iO % *^^ arg ala OPA his arg OPA^pro gly ser ala val ser gly arg arg arg ser asn gly ser 
0^ 61/21 SCQtCDNOnS^ 91/31 

tgt ttc age ggg cga cgt ggt gca ttt gcg tta act cgc gcg gag ctg gcg tec cca aaa 
cys phe ser gly arg arg gly ala phe ala leu thr arg ala glu leu ala ser pro lys 
121/41 151/51 

gat taa ggt cgc ggg cat gag eta tec gga gaa tgt cct ggc cgc tgg cga gca ggt cgt 
asp) OCH(gly arg gly his glu leu ser gly glu cys pro gly arg trp arg ala gly arg 
18f/61 VsgsabMO*.7SW 211/71 

tct gca ccg cca tec gca ctg gaa teg ctt aat ctg gee cgt cgt ggt get ggt ctt get 
ser ala pro pro ser ala leu glu ser leu asn leu ala arg arg gly ala gly leu ala 
241/81 271/91 

gac egg gtt ggc ggc gtt egg gtc egg att cgt caa etc gac acc ttg gca gca gat c 
asp arg val gly gly val arg val arg ile arg gin leu asp thr leu ala ala asp) 



SEQ ID Alo5.*n 52-15^ 
FIGURE 43C 



Coding sequence Rv3278c predicted by Cole et al., 1998 (Nature 393:537-544) and 
containing seq4 3A: 

1/1 31/11 

gtt ctg cac cgc cat ccg 



val leu his arg his pro 



^A-T&tJOlTSS atg age tat ccg gag aat gtc ctg gee get ggc gag cag gtc 

*^ n</ (Met ser tyr pro glu asn val leu ala ala gly glu gin val 

cgtft^' -15 ^ 61/21 91/31 

cac tgg aat cgc tta ate tgg ccc gtc gtg gtg ctg gtc ttg ctg acc ggg ttg gcg gcg 
his trp asn arg leu ile trp pro val val val leu val leu leu thr gly leu ala ala 
121/41 151/51 

ttc ggg tec gga ttc gtc aac teg aca cct tgg cag cag ate get aag aac gtg att cac 
phe gly ser gly phe val asn ser thr pro trp gin gin ile ala lys asn val ile his 
181/61 211/71 

gcg gtc ate tgg ggg ate tgg ttg gtg ate gtc ggc tgg etc acg ctg tgg cca ttc ctg 
ala val ile trp gly ile trp leu val ile val gly trp leu thr leu trp pro phe leu 
241/81 271/91 

age tgg ctg acc aca cat ttc gtg gtg acc aac egg egg gtg atg ttc egg cat ggt gtg 
ser trp leu thr thr his phe val val thr asn arg arg val met phe arg his gly val 
301/101 331/111 

ctg acc cgc age ggg ate gac ata ccg eta gca egg ate aac age gtg gag ttc egg gac 
leu thr arg ser gly ile asp ile pro leu ala arg ile asn ser val glu phe arg asp 
361/121 391/131 

egg ate ttc gag egg att ttt cgc acc ggg acg ttg att ate gag tec gcg tea caa gat 
arg ile phe glu arg ile phe arg thr gly thr leu ile ile glu ser ala ser gin asp 

421/141 _ 451/151 

ccg etc gag ttc tac aac att ccg cgc ctg egg gag gtg cat gcg ttg ctg tat cac gag 
pro leu glu phe tyr asn ile pro arg leu arg glu val his ala leu leu tyr his glu 
481/161 511/171 
gtt ttc gac acc ctg ggc tec gac gag teg ccc age tga 
val phe asp thr leu gly ser asp glu ser pro ser) OPA 



SEQ id ^051 755-7 56 

FIGURE 4 3D 



142/185 



ORF according to Cole et al . , 1998 (Nature 393:537-544) and containing Rv3278c 
1/1 31/11 

taa etc gcg egg age tgg cgt ccc caa aag att aag gtc gcg ggc atg age tat ccg gag 
OCH (leu ala arg ser trp arg pro gin lys ile lys val ala gly met ser tyr pro glu 
61/21 V &eQ3fcNt>nSB 91/31 

aat gtc ctg gec get ggc gag cag gtc gtt ctg cac cgc cat ccg cac tgg aat cgc tta 
asn val leu ala ala gly glu gin val val leu his arg his pro his trp asn arg leu 
121/41 151/51 

ate tgg ccc gtc gtg gtg ctg gtc ttg ctg acc ggg ttg gcg gcg ttc ggg tec gga ttc 
ile trp pro val val val leu val leu leu thr gly leu ala ala phe gly ser gly phe 
181/61 211/71 

gtc aac teg aca cct tgg cag cag ate get aag aac gtg att cac gcg gtc ate tgg ggg 
val asn ser thr pro trp gin gin ile ala lys asn val ile his ala val ile trp gly 
241/81 271/91 

ate tgg ttg gtg ate gtc ggc tgg etc acg ctg tgg cca ttc ctg age tgg ctg acc aca 
ile trp leu val ile val gly trp leu thr leu trp pro phe leu ser trp leu thr thr 
301/101 331/111 

cat ttc gtg gtg acc aac egg egg gtg atg ttc egg cat ggt gtg ctg acc cgc age ggg 
his phe val val thr asn arg arg val met phe arg his gly val leu thr arg ser gly 
361/121 391/131 

ate gac ata ccg eta gca egg' ate aac age gtg gag ttc egg gac cg$ ate ttc gag egg 
ile asp ile pro leu ala arg ile asn ser val glu phe arg asp arg ile phe glu arg 
421/141 451/151 

att ttt cgc acc ggg acg ttg att ate gag tec gcg tea caa gat ccg etc gag ttc tac 
ile phe arg thr gly thr leu ile ile glu ser ala ser gin asp pro leu glu phe tyr 
481/161 511/171 

aac att ccg cgc ctg egg gag gtg cat gcg ttg ctg tat cac gag gtt ttc gac acc s ctg 
asn ile pro arg leu arg glu val his ala leu leu tyr his glu val phe asp thr leu 
541/181 

ggc tec gac gag teg ccc age tga 
gly ser asp glu ser pro ser) OPA 



seq id a/o&: 151-^5^ 

FIGURE 43E 



143/185 



1/1 


















31/11 


















gcc aag 


atg 


gat 


gtc 


tac 


caa 


cgc 


acc 


gcc 


gcc ggc 


tgg 


cag 


ccg 


etc 


aag 


acc 


ggt 


ate 


r ala lys 


met 


asp 


val 


tyr 


gin 


arg 


thr 


ala 


ala gly 


trp 


gin 


pro 


leu 


lys 


thr 


gly 


ile 


61/21 


















91/31 


















acc acc 


cat 


ate 


ggt 


teg 


gcg 


ggc 


atg 


gcg 


ccg gaa 


gcc 


aag 


age 


gga 


tat 


ccg 


gcc 


act 


thr thr 


his 


ile 


gly 


ser 


ala 


gly 


met 


ala 


pro glu 


ala 


lys 


ser 


gly 


tyr 


pro 


ala 


thr 


121/41 


















151/51 


















ccg atg 


ggg 


gtt 


tac 


age 


ctg 


gac 


tec 


get 


ttt ggc 


acc 


gcg 


ccg 


aat 


ccc 


ggt 


ggc 


ggg 


pro met 


gly 


val 


tyr 


ser 


leu 


asp 


ser 


ala 


phe gly 


thr 


ala 


pro 


asn 


pro 


gly 


gly 


gly 


181/61 


















211/71 


















ttg ccg 


tat 


acc 


caa 


gtc 


gga 


ccc 


aat 


cac 


tgg tgg 


agt 


ggc 


gac 


gac 


aat age 


ccc 


acc 


leu pro 


tyr 


thr 


gin 


val 


gly 


pro 


asn 


his 


trp trp 


ser 


gly 


asp 


asp 


asn 


ser 


pro 


thr 


241/81 


















271/91 


















ttt aac 


tec 


atg 


cag 


gtc 


tgt 


cag 


aag 


tec 


cag tgc 


ccg 


ttc 


age 


acg 


gcc 


gac 


age 


gag 


phe asn 


ser 


met 


gin 


val 


cys 


gin 


lys 


ser 


gin cys 


pro 


phe 


ser 


thr 


ala 


asp 


ser 


glu 


301/101 


















331/111 


















aac ctg 


caa 


ate 


ccg 


cag 


tac 


aag 


cat 


teg 


gtc gtg 


atg 


ggc 


gtc 


aac 


aag 


gcc 


aag 


gtc 


asn leu 


gin 


ile 


pro 


gin 


tyr 


lys 


his 


ser 


val val 


met 


gly 


val 


asn 


lys 


ala 


lys 


val 


361/121 


















391/131 


















cca ggc 


aaa 


ggc 


tec 


gcg 


ttc 


ttc 


ttt 


cac 


acc acc 


gac 


ggc 


ggg 


ccc 


acc 


gcg 


ggt 


tgt 


pro gly 


lys 


gly 


ser 


ala 


phe 


phe 


phe 


his 


thr thr 


asp 


gly 


gly pro 


thr 


ala 


gly 


cys 


421/141 




































gtg gcg 


ate 


































val ala 


ile" 



































seq id MOS'. 159 k° 

FIGURE 44A 





1/1 




















31/1 


.1 




















cca 


aga 


tgg 


atg 


tct 


acc 


aac 


gca 


ccg 


ccg 


ccg 


get 


ggc 


age 


cgc 


tea 


aga 


ccg 


gta 


tea 


^pro 


arg 


trp 


met 


ser 


thr 


asn 


ala 


pro 


pro 


pro 


ala 


gly 


ser 


arg 


ser 


arg 


pro 


val 


ser 




61/21 
















91/31 




















cca 


ccc 


ata 


teg 


gtt 


egg 


egg 


gca 


tgg 


cgc 


egg 


aag 


cca 


aga 


gcg 


gat 


ate 


egg 


cca 


etc 




pro 


pro 


ile 


ser 


val 


arg 


arg 


ala 


trp 


arg 


arg 


lys 


pro 


arg 


ala 


asp 


ile 


arg 


pro 


leu 




121/41 


















151/51 




















cga 


tgg 


ggg 


ttt 


aea 


gcc 


tgg 


act 


ccg 


ctt 


ttg 


gca 


ccg 


cgc 


cga 


ate 


ccg 


gtg 


gcg 


ggt 




arg 


trp 


gly 


phe 


thr 


ala 


trp 


thr 


pro 


leu 


leu 


ala 


pro 


arg 


arg 


ile 


pro 


val 


ala 


gly 




181/61 


















211/71 




















tgc 


cgt 


ata 


ccc 


aag 


teg 


gac 


cca 


ate 


act 


ggt 


gga 


gtg 


gcg 


acg 


aea 


ata 


gcc 


cca 


cct 




cys 


arg 


ile 


pro 


lys 


ser 


asp 


pro 


ile 


thr 


gly 


gly 


val 


ala 


thr 


thr 


ile 


ala 


pro 


pro 




241/81 


















271/91 




















tta 


act 


cca 


tgc 


agg 


tct 


gtc 


aga 


agt 


ccc 


agt 


gcc 


cgt 


tea 


gca 


egg 


ccg 


aca 


gcg 


aga 




leu 


thr 


pro 


cys 


arg 


ser 


val 


arg 


ser 


pro 


ser 


ala 


arg 


ser 


ala 


arg 


pro 


thr 


ala 


arg 




301/101 


















331/111 



















acc tgc aaa tec cgc agt aca age att egg tcq tga tgg gcg tea aca agg cca agg tec 

thr cys lys ser arg ser thr ser ile arg ser)oPA(trp ala ser thr arg pro arg ser 
361/121 391/131 >$£«3rbNC>nCo3 

cag gca aag get ccg cgt tct tct ttc aca cca ccg acg gcg ggc cca ccg egg gtt gtg 

gin ala lys ala pro arg ser ser phe thr pro pro thr ala gly pro pro arg val val 
421/141 
tgg cga tc 
trp arg) 



SEQ ID H0S 1 .1Ccl-'?(«3 
FIGURE 44B 



144/185 



1/1 31/11 
£2t-:CM0- 1&C{ caa g a t gga tgt eta cca acg cac cgc cgc egg ctg gca gee get caa gac egg tat cac 
jk/gln asp gly cys leu pro thr his arg arg arg leu ala ala ala gin asp arg tyr his 
^61/21 91/31 
^C?- cac cca tat egg ttc ggc ggg cat ggc gec gga age caa gag egg ata tec ggc cac tec 
gO&> his pro tyr arg p he gly gly his gly ala gly ser gin glu arg ile ser gly his ser 

121/41 151/51 

gat ggg ggt tta cag cct gga etc cgc ttt tgg cac cgc gee gaa tec egg tgg egg gtt 
asp gly gly leu gin pro gly leu arg phe trp his arg ala glu ser arg trp arg val 
181/61 211/71 

gee gta tac cca agt egg ace caa tea ctg gtg gag tgg cga cga caa tag ccc cac ctt 
ala val tyr pro ser arg thr gin ser leu val glu trp arg arg gln)AMB pro his leu 
241/81 271/91 

taa etc cat gca ggt ctg tea gaa gtc cca gtg ccc gtt cag cac ggc cga cag cga gaa 
OCHO-eu his ala gly leu ser glu val pro val pro val gin his gly arg gin arg glu 
301/lOlV <?&R^t>NO\76><r 331/111 

cct gca aat ccc gca gta caa gca ttc ggt cgt gat ggg cgt caa caa ggc caa ggt ccc 
pro ala asn pro ala val gin ala phe gly arg asp gly arg gin gin gly gin gly pro 
361/121 391/131 

agg caa agg etc cgc gtt ctt ctt tea cac cac cga egg egg c cac cgc ggg ttg tgt 
arg gin arg leu arg val leu leu ser his his arg arg arg a.. his arg gly leu cys 
421/141 
ggc gat c 

gly asp} ' " ' 



SEQ ID No AfOS'.l&M- 7&<b 
FIGURE 44C 



145/185 



Coding sequence Rv0309 predicted by Cole et al . , 1998 (Nature 393:537-544) and 
containing Seq44A: 

1/1 31/11 
utf/U»* *^atg age cga etc eta get ttg ctg tgc get gcg gta tgc acg ggc tgc gtt get gtg gtt 

t a NMet ser arg leu leu ala leu leu cys ala ala val cys thr gly cys val ala val val 

^ * 61/21 91/31 

etc gcg cca gtg age ctg gec gtc gtc aac ccg tgg ttc gcg aac teg gtc ggc aat gee 
leu ala pro val ser leu ala val val asn pro trp phe ala asn ser val gly asn ala 
121/41 151/51 

act cag gtg gtt teg gtg gtg gga acc ggc ggt teg acg gee aag atg gat gtc tac caa 
thr gin val val ser val val gly thr gly gly ser thr ala lys met asp val tyr gin 
181/61 211/71 

cgc acc gee gee ggc tgg cag ccg etc aag acc ggt ate acc acc cat ate ggt teg gcg 
arg thr ala ala gly trp gin pro leu lys thr gly ile thr thr his ile gly ser ala 
241/81 271/91 

ggc atg gcg ccg gaa gee aag age gga tat ccg gec act ccg atg ggg gtt tac age ctg 
gly met ala pro glu ala lys ser gly tyr pro ala thr pro met gly val tyr ser leu 
301/101 331/111 

gac tec get ttt ggc acc gcg ccg aat ccc ggt ggc ggg ttg ccg tat acc caa gtc gga 
asp ser ala phe gly thr ala pro asn pro gly gly gly leu pro tyr thr gin val gly 
361/121 391/131 

ccc aat cac tgg tgg agt ggc gac gac aat age ccc acc ttt aac tec atg cag gtc tgt 
pro asn his trp trp ser gly asp asp asn ser pro thr phe asn ser met gin val cys 
"■ ■'■ " " "421/141 " ' ■' 451/151 

cag aag tec cag tgc ccg ttc age acg gee gac age gag aac ctg caa ate ccg cag tac 
gin lys ser gin cys pro phe ser thr ala asp ser glu asn leu gin ile pro gin tyr 
481/161 511/171 

aag cat teg gtc gtg atg ggc gtc aac aag gee aag gtc cca ggc aaa ggc tec gcg ttc 
lys his ser val val met gly val asn lys ala lys val pro gly lys gly ser ala phe 
541/181 571/191 

ttc ttt cac acc acc gac ggc ggg ccc acc gcg ggt tgt gtg gcg ate gac gat gee acg 
phe phe his thr thr asp gly gly pro thr ala gly cys val ala ile asp asp ala thr 
601/201 631/211 

ctg gtg cag ate ate cgt tgg ctg egg cct ggt gcg gtg ate gcg ate gee aag taa 
leu val gin ile ile arg trp leu arg pro gly ala val ile ala ile ala lys^ OCH 



seq id HD$: 1^1" T 

FIGURE 44D 



146/185 



ORF according to Cole et al., 1998 (Nature 393:537-544) and containing Rv0309 



1/1 

^O'^^^tga gcg atg age cga etc eta get ttg 
^ZO^ OPA(ala met ser arg leu leu ala leu 

61/21 ^TOXbND'.TlD 

gtg gtt etc gcg cca gtg age ctg gee 
val val leu ala pro val ser leu ala 
121/41 

aat gee act cag gtg gtt teg gtg gtg 
asn ala thr gin val val ser val val 
181/61 

tac caa cgc acc gee gee ggc tgg cag 
tyr gin arg thr ala ala gly trp gin 
241/81 

teg gcg ggc atg gcg ccg gaa gee aag 
ser ala gly met ala pro glu ala lys 
301/101 

age ctg gac tec get ttt ggc acc gcg 
ser leu asp ser ala phe gly thr ala 
361/121 

gtc gga ccc aat cac tgg tgg agt ggc 
val gly pro asn his trp trp ser gly 
421/141 

gtc tgt cag aag tec cag tgc ccg ttc 
val cys gin lys ser gin cys pro phe 
481/161 

cag tac aag cat teg gtc gtg atg ggc 
gin tyr lys his ser val val met gly 
541/181 

gcg ttc ttc v ttt cac acc acc gac ggc 
ala phe phe phe his thr thr asp gly 
601/201 

gee acg ctg gtg cag ate ate cgt tgg 

ala thr leu val gin ile ile arg trp 

661/221 

taa 

OCH 



31/11 

ctg tgc get gcg gta tgc acg ggc tgc gtt get 
leu cys ala ala val cys thr gly cys val ala 
91/31 

gtc gtc aac ccg tgg ttc gcg aac teg gtc ggc 
val val asn pro trp phe ala asn ser val gly 
151/51 

gga acc ggc ggt teg acg gee aag atg gat gtc 
gly thr gly gly ser thr ala lys met asp val 
211/71 

ccg etc aag acc ggt ate acc acc cat ate ggt 
pro leu lys thr gly ile thr thr his ile gly 
271/91 

age gga tat ccg gee act ccg atg ggg gtt tac 
ser gly tyr pro ala thr pro met gly val tyr 
331/111 

ccg aat ccc ggt ggc ggg ttg ccg tat acc caa 
pro asn pro gly gly gly leu pro tyr thr gin 
391/131 

gac gac aat age ccc acc ttt aac tec atg cag 
asp asp asn ser pro thr phe asn ser met gin 
451/151 

age acg gee gac age gag aac ctg caa ate ccg 
ser thr ala asp ser glu asn leu gin ile pro 
511/171 

gtc aac aag gee aag gtc cca ggc aaa ggc tec 
val asn lys ala lys val pro gly lys gly ser 
571/191 

ggg ccc acc gcg ggt tgt gtg gcg ate gac gat 
gly pro thr ala gly cys val ala ile asp asp 
631/211 

ctg egg cct ggt gcg gtg ate gcg ate gee aag^ 
leu arg pro gly ala val ile ala ile ala lys 



SEQ ID lUt>S:*7^M7D 
FIGURE 44F 
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Cloned fragment fused with phoA 

1/1 31/11 

•nT\— 3 > gat: ctc ccc gga cac cag gtc atc cgg cga gat ggt gat cga ggc tcg gac ccg cag gca 
*/asp leu pro gly his gin val ile arg arg asp gly asp arg gly ser asp pro gin ala 

•^^.^ 61/21 91/31 

fv"355> tcc 99t age eag agg cac cag cat cag caa cat cgc gat ggc cag cat gec gcg ccg tcg 

ser gly ser gin arg his gin his gin gin his arg asp gly gin his ala ala pro ser 

121/41 151/51 

ggt cct tgc cac tcg cga tcc ttg gga tga egg tgg ggc ata get age gcg cac cag gtc 
gly pro cys his ser arg ser leu gly^OPA^arg trp gly ile ala ser ala "his gin val 
181/61 XQ^hxiCT^^ 211/71 

atc gtg cca gac egg gca tgc cgc gtc ggc aag ctg tcg ggc gcg ggt tag age ggt age 
ile val pro asp arg ala cys arg val gly lys leu ser gly ala gly^AMB^ser gly ser 
241/81 271/91 ^CXOHtr.llH 

gtg cga ccc agg atg gcg aat get egg ggg tea ccg gcg aag tgg tag ccg egg atg atg 
val arg pro arg met ala asn ala arg gly ser pro ala lys trp)AMB(pro arg met met 
301/101 331/111 ^QJbi«l75 

tcg gtg aag ccc aac egg egg tac aac cgc cac gee cga ttg tcc tea ccg ttg gtc tcc 
ser val lys pro asn arg arg tyr asn arg his ala arg leu ser ser pro leu val ser 
361/121 391/131 

ggt gtg gag age agg acg ttg tcc tcg tcg cga ccg get age agt egg egg gec aac gec 
gly val glu ser arg thr leu ser ser ser arg pro ala ser ser arg arg ala asn ala 
421/141 451/151 

tcc ccg agg cca egg cct tga gcg egg gga agg atg tgc aat tea gtc aac tcg aag tag 
* ' - 4 - ser' pro arg' pro arg pro) OPAjfala ' arg gly arg met cys asn ser val asn ser'lys^AMB 

481/161 ^6fS3wo\TlU 511/171 

ctg gtc atc agt egg gcg atc get agg cgc gga aag ccg ctg cgt tgc aag ccc agt ace 
-*_»(leu val ile ser arg ala ile ala arg arg gly lys pro leu arg cys lys pro ser thr 
.;0 X 541/181 571/191 

ace tgc tgt tgc cac cac tgg ccg ggc gee ccg gga tag ccg tac gec act ccg age att 
thr cys cys cys his his trp pro gly ala pro gly/AMB/pro tyr ala thr pro ser ile 
0 601/201 631/211 V^QXlifionn 

ggc gcg ttg ctc agt tcg gcg gee gac ggc age gec gtg gtg tcg gcg gec tcg gee tgt 
gly ala leu leu ser ser ala ala asp gly ser ala val val ser ala ala ser ala cys 
661/221 691/231 

tcg get gec gtt acc tcg acg gee gcg acc gec tgc cag ccg cgc cgc egg atg tgc tcc 
ser ala ala val thr ser thr ala ala thr ala cys gin pro arg arg arg met cys ser 
721/241 751/251 

age cac att ggg gcg cgc aaa gtc tcg gtg ccc ctg ggg tag cgc atc gcg tcg aca tac 
ser his ile gly ala arg lys val ser val pro leu gly)AMB (arg ile ala ser thr tyr 
781/261 811/271 1W 

acc gtc agg gca tea ccg agg egg cgc tcc ata tcg ctg ggc ggc aga tcg atg agg aat 
thr val arg ala ser pro arg arg arg ser ile ser leu gly gly arg ser met arg asn 
841/281 871/291 

atc gee aac gcg egg tgt cct cct cat gtg atg aac cga tgc gtg ctt gcg cac cag tat 
ile ala asn ala arg cys pro pro his val met asn arg cys val leu ala his gin tyr 
901/301 931/311 

— _ G gg_ aca agc^ cga^tga-ggc~agc-ccg^cgc_tgg™acg ggg_ctt.„gta„ gcg„ tat. _ggc_ _cg.tL.ttc cgc 

arg thr ser arg^OPAfaly arg pro arg trp thr gly leu val ala tyr gly arg phe arg 

56Q ia> ho: tf£> 



seq id ja<xs: -77/-7tf2> 

FIGURE 45ZA 
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961/321 991/331 

tea get cgt cgc tgc ggc gec gec ggg ata gaa teg ccc gcg aac cag tgg tac ggc gca 
ser ala arg arg cys gly ala ala gly ile glu ser pro ala asn gin trp tyr gly ala 
1021/341 1051/351 

gat tga cct cgt ate ate tga gtt agt tgc ccg cgc aat ggg cat ccg cgt gtt ate ggt 
asp)oPA(pro arg ile ile)OPA/val ser cys pro arg asn gly his pro arg val ile gly 
1081/361 Hec^ND*.™ i^tOMO'.lW 1111/371 

att acg tga cag tct gtc ggc aag gag gga cgc atg cca etc tec gat cat gag cag egg 
ile thr)OPA/gln ser val gly lys glu gly arg met pro leu ser asp his glu gin arg 

1141/381 Seiabwomi 1171/391 

atg ctt gac cag ate gag age get etc tac gec gaa gat ccc aag ttc gca teg agt gtc 
met leu asp gin ile glu ser ala leu tyr ala glu asp pro lys phe ala ser ser val 
1201/401 1231/411 

cgt ggc ggg ggc ttc cgc gca ccg ace gcg egg egg cgc ctg cag ggc gcg gcg ttg ttc 
arg gly gly gly phe arg ala pro thr ala arg arg arg leu gin gly ala ala leu phe 
1261/421 1291/431 

ate ate ggt ctg ggg atg ttg gtt tec ggc gtg gcg ttc aaa gag acc atg ate gga agt 
ile ile gly leu gly met leu val ser gly val ala phe lys glu thr met ile gly ser 
1321/441 1351/451 

ttc ccg ata etc age gtt ttc ggt ttt gtc gtg atg ttc ggt ggt gtg gtg tat gee ate 
phe pro ile leu ser val phe gly phe val val met phe gly gly val val tyr ala ile 
1381/461 1411/471 

acc ggt cct egg ttg tec ggc agg atg gat cgt ggc gga teg get get ggg get teg cgc 
thr gly pro arg leu ser gly arg met asp arg gly gly ser ala ala gly ala ser arg 
1441/481 ' ' 1471/491 ' 

cag cgt cgt acc aag ggg gee ggg ggc tea ttc acc age cgt atg gaa gat c 
gin arg arg thr lys gly ala gly gly ser phe thr ser arg met glu asr^ 



SEQ ID NASmi- (continued) 
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fragment seq45ZA shifted minus 1 for the reading frame 
l/l^* 0 ^- 1 ^ 31/11 

ate tec ccg gac acc agg tea tec ggc gag atg gtg ate gag get egg acc cgc agg cat 
jhile ser pro asp thr arg ser ser gly glu met val ile glu ala arg thr arg arg his 
61/21 91/31 

ccg gta gcc aga ggc acc age ate age aac ate gcg atg gcc age atg ccg cgc cgt egg 
f^' pro val ala arg gly thr ser ile ser asn ile ala met ala ser met pro arg arg arg 

121/41 151/51 

gtc ctt gcc act cgc gat cct tgg gat gac ggt ggg gca tag eta gcg cgc acc agg tea 
val leu ala thr arg asp pro trp asp asp gly gly alaAAMB (leu ala arg thr arg ser 
18 1/ 6 1 211/71 ' v V^ & :«>HD-.7*6> 

teg tgc cag acc ggg cat gcc gcg teg gca age tgt egg gcg egg gtt aga gcg gta gcg 
ser cys gin thr gly his ala ala ser ala ser cys arg ala arg val arg ala val ala 
241/81 271/91 

tgc gac cca gga tgg cga atg etc ggg ggt cac egg cga agt ggt age cgc gga tga tgt 
cys asp pro gly trp arg met leu gly gly his arg arg ser gly ser arg glylOPA cys 
301/101 331/111 

egg tga age cca acc ggc ggt aca acc gcc acg ccc gat tgt cct cac cgt tgg tct ccg 
arg OPA(ser pro thr gly gly thr thr ala thr pro asp cys pro his arg trp ser pro 
361/121 J~$eft*t>N£>;i*l 391/131 

gtg tgg aga gca gga cgt tgt cct cgt cgc gac egg eta gca gtc ggc ggg cca acg cct 
val trp arg ala gly arg cys pro arg arg asp arg leu ala val gly gly pro thr pro 
421/141 451/151 

ccc cga ggc cac ggc ctt gag cgc ( ggg gaa gga tgt gca att cag tea act cga agt age 
pro arg gly his gly leu glu arg' gly glu gly cys ala ile gin ser ' thr arg ser ser 
481/161 511/171 

tgg tea tea gtc ggg cga teg eta ggc gcg gaa age cgc tgc gtt gca age cca gta cca 
trp ser ser val gly arg ser leu gly ala glu ser arg cys val ala ser pro val pro 
541/181 571/191 

cct get gtt gcc acc act ggc egg gcg ccc egg gat age cgt acg cca etc cga gca ttg 
pro ala val ala thr thr gly arg ala pro arg asp ser arg thr pro leu arg ala leu 
601/201 631/211 

gcg cgt tgc tea gtt egg egg ccg acg gca gcg ccg tgg tgt egg egg cct egg cct gtt 
ala arg cys ser val arg arg pro thr ala ala pro trp cys arg arg pro arg pro val 
661/221 691/231 

egg ctg ccg tta cct cga egg ccg cga ccg cct gcc age cgc gcc gcc gga tgt get cca 
arg leu pro leu pro arg arg pro arg pro pro ala ser arg ala ala gly cys ala pro 
721/241 751/251 

gcc aca ttg ggg cgc gca aag tct egg tgc ccc tgg ggt age gca teg cgt cga cat aca 
ala thr leu gly arg ala lys ser arg cys pro trp gly ser ala ser arg arg his thr 
781/261 811/271 

ccg tea ggg cat cac cga ggc ggc get cca tat cgc tgg gcg gca gat cga tga gga ata 
pro ser gly his his arg gly gly ala pro tyr arg trp ala ala asp arg/OPAfgly 
841/281 871/291 v *eOxbW>n** 

teg cca acg cgc ggt gtc etc etc atg tga tga acc gat gcg tgc ttg cgc acc agt ate 
ser pro thr arg gly val leu leu met) OP A OPA^thr asp ala cys leu arg thr ser lie 
901/301 ' 931/311>s<cQT:OW^^ 

_ gg- a — ca - a gcc-gat-gag-gcc-gec-cge-gct- -gga egg ggc--ttg-tag^cgt-atg^gcc_gt.t_tcc_ get 

qlv gin ala asp glu ala ala arg ala gly arg gly leu) AMB (arg met ala val ser ala 
961/321 991/331 ' V&XfrHtt W 

cag etc gtc get gcg gcg ccg ccg gga tag aat cgc ccg cga acc agt ggt acg gcg cag 
gin leu val ala ala ala pro pro gly} AMB (asn arg pro arg thr ser gly thr ala gin 



seq id m&s: 7<24- 
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1021/341 1051/351 

att qac etc gta tea tct gag tta gtt gec cgc gca atg ggc ate cgc gtg tta teg gta 
ile aso leu val ser ser glu leu val ala arg ala met gly ile arg val leu ser val 

1081/361 1111/371 
tta cat gac agt ctg teg gca agg agg gac gca tgc cac tct ccg ate atg age age gga 
leu arq asp ser leu ser ala arg arg asp ala cys his ser pro ile met ser ser gly 
1141/381 1171/391 

tac tta ace aga teg aga gcg etc tct acg ccg aag ate cca agt teg cat cga gtg tec 
cvs leu thr arg ser arg ala leu ser thr pro lys ile pro ser ser his arg val ser 
1201/401 1231/411 

ota qcg gag get tec gcg cac cga ccg cgc ggc ggc gee tgc agg gcg egg cgt tgt tea 
val ala gly ala ser ala his arg pro arg gly gly ala cys arg ala arg arg cys ser 
1261/421 1291/431 

tea teg gtc tgg gga tgt tgg ttt ccg gcg tgg cgt tea aag aga ccatga teg gaa gtt 
ser ser val trp gly cys trp phe pro ala ^arg^ser lys arg P-^K^Q ^O'. \<i 1. 
telega 1 tac tea gcg ttt teg gtt ttg teg tga tgt teg gtg gtg tgg tgt atg cca tea 

ii 8 i g 6i tyr ser ala phe ser val leu ""^StelJSiS-NS cys met pro ser 

ccg gtc etc ggt tgt eeg gca gga tgg ate gtg gcg gat egg ctg ctg ggg ctt cgc gee 
pro val leu gly cys pro ala gly trp ile val ala asp arg leu leu gly leu arg ala 
1441/481 1471/491 
, age gtc gta cca agg ggg ccg ggg get cat tea cca gee gta tgg aag ate 
ser val val pro arg gly pro gly ala His ser pro ala val trp lys lie) 



SEQ ID NCfclS'J^ (continued) 
FIGURE 45ZB (continued) 
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fragment seq45ZA shifted minus 2 for the reading frame 

^-»rsirs 4 ^ 2 2 s s %&$szF - = - - 

^ /» - - «- - - is 2 2 2 2 2 2 2 2 2 



«Ji»A(pro glu .1. pro .1. .« .1. thr aer arg trp pro 
121/41 ^lll**™ 151/S1 



= 2 2 2 2 s 2 2 2 a 22 s =)» 5222 2 

2 2 2 2 2 2 2 2 =! 2 L s - 25 " s 52 ;SS 

s = 2 2 2 2 2 2 2 2 IlFi - a - = .= - - s 

301/101 t tc ctc acc gtt ggt etc egg 

2 2 ttt 2 2 2 si 2 2 2 22 u. »i - tht «i ,1, u. 
Ui «i «« -<= «s ss 2 2 2 2-1K s 2 2 2 2 2 

cys gly glu gin asp val val leu val ala ^gxyj VV 5cax & N»: 

2 2 2 2 K 2 22 2 2 5,2 2 2 2 22 2 22 
S2 2 22 2 =*2$ 2 £1 s H a - s 52 - = 

541/181 ccc ggg ata gec gt. cgc =»c tec gag cat tgg 

2 til til 2 2 2 2 2 2 2 2^ 2 2 - =« >*• " p 
2 2 2 2 2 2 2 2, 2 2 22 2 2 2 2 2 2 2 2 
i'i 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 
2 2 3 2 2 2 2 2 2 2 22 2 2 2 2 2 2 2 2 
22 2 2 2 2 2 2 2 2 22 2 2 2 £ 2 2 2 2 



871/291 

52 2 2 2 2 2 2 2 2 2 2 2 S 2 2 2 2 2 2 

2ri 2~2 2lS2 22 22 22 2 222 22 2 



so id NPSI-TIM-^M 
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961/321 991/331 

aac teg teg ctg egg cgc cgc egg gat aga ate gee cgc gaa cca gtg gta egg cgc aga 
ser ser ser leu arg arg arg arg asp arg ile ala arg glu pro val val arg arg arg 
1021/341 1051/351 

tta ace teg tat cat ctg agt tag .ttg ccc gcg caa tgg gca tec gcg tgt tat egg tat 
leu thr ser tyr his leu ser) AMB (leu_pro ala gin trp ala ser ala cys tyr arg tyr 
1081/361 ND'.Soa^ 1111/371 

tac gtg aca gtc tgt egg caa* gga ggg acg cat gec act etc cga tea tga gca gcg gat 

SUSi tht val cys arg gln gly gly thr 5ivi/ 3 a i thr leu arg ser)opA(a i«Sm^ 

act tga cca gat cga gag cgc tct eta cgc cga aga tec caa gtt cgc ate gag tgt ccg 
ala^OPA/pro asp arg glu arg ser leu arg arg arg ser gin val arg ile glu cys pro 
1201/401 Vi6&Tb*t**t>4 1231/411 

t cgg „g g c tt ccg cgc acc gac cgc gcg gcg gcg cct gca ggg cgc ggc gtt gtt cat 
tr? arg gly leu pro arg thr asp arg ala ala ala pro ala gly arg gly val val his 
1261/421 1291/431 

cat cqq tct ggg gat gtt ggt ttc egg cgt ggc gtt caa aga gac cat gat egg aag ttt 
his arg ser gly asp val gly phe arg arg gly val gin arg asp his asp arg lys phe 
1321/441 1351/451 

ccc gat act cag cgt ttt egg ttt tgt cgt gat gtt egg tgg tgt ggt gta tgc cat cac 
pro asp thr gin arg phe arg phe cys arg asp val arg trp cys gly val cys his his 
1381/461 1411/471 

egg tec teg gtt gtc egg cag gat gga teg tgg egg ate ggc tgc tgg ggc ttc gcg cca 
arg ser ser val val arg gin asp gly ser trp arg ile gly cys trp gly phe ala pro 

■ • 1441/481 1471/491 . 

gcg teg tac caa ggg ggc egg ggg etc att cac cag ccg tat gga aga tc 
ala ser tyr gin gly gly arg gly leu ile his gin pro tyr gly arg ^ 

SEQ ID N&^jfl (continued 1) 
FIGURE 45ZC (continued 1) 

seq 45ZA joLfxtci dvec+Vy +o phoA 
rfY . Htf.^Vcag tct gtc ggc aag gag gga cgc atg cca etc tec gat cat gag cag egg 

ngln ser val gly lys glu gly arg met pro leu ser asp his glu gin arg 

1141/381 1171/391 

atg ett gac cag ate gag age get etc tac gee gaa gat ccc aag ttc gca teg agt gtc 
^ lit til Lp gin ile glu ser ala leu tyr ala glu asp pro lys phe ala ser ser val 

1201/401 1231/411 

cgt ggc ggg ggc ttc cgc gca ccg acc gcg egg egg cgc ctg cag ggc gcg gcg ttg ttc 
arg gly gly gly phe arg ala pro thr ala arg^arg^arg leu gin gly ala ala leu phe 

at^a^ggt ctg ggg atg ttg gtt tec ggc gtg gcg ttc aaa gag acc atg ate gga agt 
ile ile gly leu gly me? leu val ser gly val ala phe lys glu thr met He gly ser 

tirccg^atTctT^ "gtt" ttc ggt ttt gtc gtg atg ttc ggt ggt gl^tg "OT^cc ate 
phe pro ile leu ser val phe gly phe val v jJ ^ P he gly gly Val Val tyt 

acc'ggt'cct egg ttg tee ggc agg atg gat cgt'ggc'gga teg get get ggg get teg cgc 
thr gly pro arg leu ser gly arg met asp arg gly gly ser ala ala gly ala ser arg 
1441/481 1471/491 

cag cgt cgt acc aag ggg gee ggg ggc tea ttc acc age cgt atg gaa gat c 
gin arg arg thr lys gly ala gly gly ser phe thr ser arg met glu asp) 

seq id ND5: 



FIGURE 45A 
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Sequence Rv2169c predicted by Cole et al., 1998 (Nature 393:537-544) and 
containing Seq4 5A 



i/i 31/11 



^cwtfWvaatg cca etc tec gat cat gag cag egg atg ctt gac cag ate gag age get etc tac gee 

-*2SeI pro leu ser asp his glu gin arg met leu asp gin ile glu ser ala leu tyr ala 
X^**^ 61/21 91/31 

^ gaa gat eee aag tte gca teg agt gte egt ggc ggg ggc ttc cge gca ccg ace geg egg 

alu asp pro lys phe ala ser ser val arg gly gly gly phe arg ala pro thr ala arg 



121/41 151/51 



Lege etg cag ggc geg geg ttg ttc ate ate ggt etg ggg atg ttg gtt tec ggc gtg 
all arg leu gin gly ala ala leu phe ile jl^gly leu gly met leu val ser gly val 

aeg'tte aaa gag ace atg ate gga agt tte eeg ata etc age gtt tte ggt ttt gte gtg 
ala phe lys glu thr met ile gly ser phe pco ±1« leu ser val phe gly phe val val 

III' lie ggt ggt gtg gtg tat gee ate ace ggt cet egg ttg tee ggc agg atg gat. egt 
mil phe gly gly val val tyr ala ile thr gly pro arg leu ser gly arg met asp arg 
301/101 331/111 

„ c aaa tca act get ggg get teg cgc cag egt egt acc aag ggg gee ggg ggc tea ttc 
Tly ITy ser 111 .1. gly ala ser arg gin arg arg thr lys gly ala gly gly ser phe 
9 y y Y 391/131 



361/121 



acc age egt atg gaa gat egg ttc egg cgc cgc ttc gac gaa taa 
thr ser arg met glu asp arg phe arg arg arg phe asp glu,>OCH 

SEQ ID hios: SO SOS 
FIGURE 45D 

ORF according to Cole et al., 1998 (Nature 393:537-544) and containing Rv2169c 

1/1 31/11 
<rf*^rL caa tct ate ggc aag gag gga cgc atg cca etc tec gat cat gag cag egg atg ctt 
^V****Si<gE sir 111 gly lyJ glu Hv «g met pro leu ser asp his glu gin arg met leu 

^ «c cag ate gag age get etc tac gee gaa gat ecc aag ttc gca teg agt gte egt ggc 

as"p g2 tie glu ser ala leu tyr ala glu asp^pro lys phe ala ser ser val arg gly 

qqq'ggc ttc ege gca ccg ace geg egg egg cge etg cag ggc geg geg ttg ttc ate ate 
gly g!y phe arg ala pro thr ala arg arg arg^leu gin gly ala ala leu phe He He 

qgt'ctg ggg atg ttg gtt tee gge gtg geg ttc aaa gag ace atg ate gga agt ttc eeg 
g!y leu gly me? leu val ser gly val ala phe lys glu thr met He gly ser phe pro 



ata'ctc aac gtt ttc ggt ttt gte gtg atg ttc'ggt ggt gtg gtg tat gee ate acc ggt 
III leu ser 111 phe gly phe val val met ph^gly gly val val tyr ala He thr gly 

"eefegg ttg tee ggc agg "atg gat egt ggc gga "teg get get ggg flcTtcg" ege cag egt 
pro III leu ser gly arg met asp arg gly J« ala ala gly ala " E " 9 ^ * 

ItVlll aaa qgg gee ggg ggc tea ttc acc age egt atg gaa gat egg ttc egg cgc cgc 
ll g thr Tyl HI Til HI gly ser phe thr ser arg met glu asp arg phe arg arg arg 
421/141 

ttc gac gag taa 
phe asp glu^OCH 

seq id NDS'.SO^-SI^ 

FIGURE 45B 
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1/1 31/11 
e a*t>tf**H^cag ccg cgc cgc ate gac cag ggc etc acg ccc ggt cac ttc tec gcg ttc etc aac aat 
*£?gln pro arg arg ile asp gin gly leu thr pro gly his phe ser ala phe leu asn asn 
61/21 91/31 

tec aat gaa cat cgc acc agg tta ggc age aat ccc gcg gac ccg cac ccc act cgc cga 
ser alv glu his arg thr arg leu gly ser asn pro ala asp pro his pro thr arg arg 

^ 2 ^ 1 CI /CI 



cca'acc aac tea cag aca ccc tct acg atg cag ggt atg egg acc ccc aga cgc cac tgc 
pro ala asn ser gin thr pro ser thr met gin gly met arg thr pro arg arg his cys 

r LU 3 211/71 

cq^ cac ate gee gtc etc gee gee gtt age ate gec gee act gtc gtt gee ggc tgc teg 
ara arg tie ala val leu ala ala val ser ile ala ala thr val val ala gly cys ser 

y ^ 271/91 
tca'ggc teg aag cea age ggc gga cea ctt ccg gac gcg aag ccg ctg gtc gag gag. gee 
ser gly ser lys pro ser gly gly pro leu pro asp ala lys pro leu val glu glu ala 
301/101 331/111 

acc qcg cag acc aag get etc aag age gcg cac atg gtg ctg acg gtc aac ggc aag ate 
thr ala gll thr lys ala leu lys ser ala his met val leu thr val asn gly lys He) 

seq id nos: Sn-^ix 

FIGURE 46A 



1/1 31/11 



til lit IT, III III III lit HI 111 III III £ III III III III HI III 111 

m^tsUti*^ 91/31 

lla ata aae ate gca cea ggt tag gca gca ate ccg egg ace ege ace eca etc gee gae 

pro 111 «n t£ ala pro gly^ala ala ile pro arg thr arg thr pro leu ala asp 

121/41 SEC&WM 151/51 „„_ ^ n ,r- acc 



egg eca act cac aga cac cct eta cga tgc agg gta tgc gga ccc cea gac gee act gec 
arg pro thr his arg his pro leu arg cys arg val cys gly pro pro asp ala thr ala 

211/71 

qtc'gca teg ccg tee teg ccg ccg tta gca teg ccg eca ctg teg ttg ccg get get cgt 
val ala ser pro ser ser pro pro leu ala ser pro pro leu ser leu pro ala ala arg 

^ 271/91 
III' 11* caa age caa gcg gcg gac cac ttc egg acg cga age cgc tgg teg agg agg cea 
III 111 arg sir gtn 111 111 ls P his phe arg^thr arg ser arg trp ser arg arg pro 

cca'cgc aga eca agg etc tea aga gcg cgc aca tgg tgc tga egg tea acg gca aga tc 
™rg arg pro^g leu ser arg ala arg B« trp ^ 



SEQ ID KOS/UtS-^fk 
FIGURE 46B 



155/185 



1/1 31/11 



^tfW^? ' cat cga cca ggg c ct cac gec egg tea ctt etc cgc gtt cct caa caa ttc 

^* a^a pro his arg pro gly pro his ala arg ser leu leu arg val pro gin gin phe 

:«S*>* 61/21 91/31 

egg tga aca teg cac cag gtt agg cag caa tee cgc gga cec gca cee cac teg ccg acc 
ar5j0PA(thr ser his gin val arg gin gin ser arg gly pro ala pro his ser pro thr 
121/41 seaS&W'-Stt 151/51 

accaa etc aca gac acc etc tac gat gca ggg tat gcg gac ccc cag acg cca ctg ccg 
g?y gin leu thr asp thr leu tyr asp ala ala asp pro gin thr pro leu pro 

tc^ cat cgc cgt cct cgc cgc cgt tag cat cgc cgc cac tgt cgt tgc egg ctg etc gtc 
ser his arg arg pro arg arg .<$AMB(hi. arg arg his cys arg eys arg leu leu val 
241/81 SeQXfcNP'.SaO- 5 * 271/91 

'tc gaa gec aag egg egg acc act tec gga cgc gaa gec get ggt cga gga ggc cac 
all tin glu ala lys arg arg thr thr ser gly arg glu ala ala gly arg gly gly his 
y y * 331/111 



cgc'gca gac caa ggc tct eaa gag cgc gca eat ggt get gac ggt caa egg caa gat c 
arg ala asp gin gly ser gin glu arg ala his gly ala asp gly gin arg gin asp) 



SEQ ID t\05'.%il-^O 
FIGURE 46C 



156/185 



Coding sequence Rvl411c predicted by Cole et al., 1998 (Nature 393: 537-544) and 
containing seq4 6A: 



i/i 31/11 



^•Sb***^' egg acc ccc aga cgc cac tgc cgt cgc ate gec gtc etc gee gee gtt age ate gee 
^^Mel arg thr pro arg arg his eys arg arg ile ala val leu ala ala val ser He ala 
j^^t>*^* . 91/31 

• acc act gtc gtt gec ggc tgc teg teg ggc teg aag cca age ggc gga eca ctt ccg gac 
Tit thr val val La gly eys ser ser gly ser^lys pro ser gly gly pro leu pro asp 

acg'aag ccg ctg gtc gag gag gee ace gcg cag acc aag get etc aag age gcg cac atg 
a^a lys pro leu val glu glu ala thr ala gin thr lys ala leu lys ser ala his met 

2 r 211/71 
qt^ctg acg gtc aac ggc aag ate ccg gga ctg tct ctg aag acg ctg age ggc gat etc 
S leu thr val asn gly lys ile pro gly leu ser leu lys thr leu ser gly asp leu 

27 1/ 91 

ac^ acc aac ccc acc gec gcg acg gga aac gtc aag etc acg ctg ggt ggg tct gat ate 
ITr thr asn pro thr ala ala thr gly asn val lys leu thr leu gly gly ser asp He 
, nl/101 331/111 

oat gec gac ttc gtg gtg ttc gac ggg ate ctg tac gec acc ctg acg ccc aac. cag tgg 
asp Tit asp phe Vl val phe asp gly ile leu^tyr ala thr leu thr pro asn gin trp 

age'gat ttc ggt ccc gec gee gac ate tac gac ccc gee cag gtg ctg aat ccg gat ace 
ier asp phe gly pro ala ala asp ile tyr asp prb ala gin val leu asn pro- asp thr 

aac'cU gec aac gtg ctg gcg aat ttc gee gac gca aaa gec gaa ggg egg gat acc ate 
g!y ill Tit asn va! leu 111 asn phe ala asp^ala lys ala glu gly arg asp thr He 

aac'ggc cag aac ace ate cgc ate age ggg aag gta teg gca cag geg gtg aac cag ata 
1st gly gin asn thr ile arg ile ser gly lys^val ser ala gin ala val asn gin xle 

geg 7 ccg ccg ttc aac gcg acg cag eeg gtg ccg geg acc gtc tgg att cag gag acc ggc 
III pro pro phe asn ala thr gin pro val pro ala thr val trp xle gin glu thr gly 

/-Ai /oni 631/211 

gat cat eaa ctg gca cag gee cag ttg gae cgc ggc teg ggc aat tec gtc cag atg acc 
asp his gin leu ala gin ala gin leu asp arg gly ser gly asn ser val gin met thr 

661/221 691/231 

ttg teg aaa tgg ggc gag aag gtc cag gtc acg aag ccc ccg gtg age tga 
leu ser lys trp gly glu lys val gin val thr lys pro pro val ser)OPA 



seq id N.os'.'sai-^a^i 

FIGURE 46D 



157/185 



ORF according to Cole et al . , 1998 (Nature 393: 537-544): 
and containing the coding sequence Rvl411c: 

1/1 31/11 



__ tf «»3v*tag etc acc cag gtt gga ccg gtt cag tgt etc ggc cat cac gtc ggc ggt gaa ttg gec 
_^>^ 0 ' AMB Cleu thr gin val gly pro val gin cys leu gly his his val gly gly glu leu ala 

*^ 61 /£l>5 < rfi>xt>N6iSX«-l 91/31 

qtc ggg caa tac ate gac gac cgt cag aca cac gec gtt gac age gat cga gtc gee gtg 
val alV gin tyr ile asp asp arg gin thr his ala val asp ser asp arg val ala val 
121/41 151/51 

acc ggc gtc ggc ggt aac cat egg acc gcg gat ggt cag ccg cgc cgc ate gac cag ggc 
ala gly val gly gly asn his arg thr ala asp gly gin pro arg arg ile asp gin gly 
181/61 211/71 

etc acg ccc ggt cac ttc tec gcg ttc etc aac aat tec ggt gaa cat cgc acc agg tta 
leu thr pro gly his phe ser ala phe leu asn asn ser gly glu his arg thr arg leu 
241/81 271/91 

qqc age aat ccc gcg gac ccg cac ccc act cgc cga ccg gee aac tea cag aca ccc tct 
alv ser asn pro ala asp pro his pro thr arg arg pro ala asn ser gin thr pro ser 

331/111 

acg a?g cag ggt atg egg acc ccc aga cgc cac tgc cgt cgc ate gee gtc etc gee gec 
thr met gin gly met arg thr pro arg arg his cys arg arg ile ala val leu ala ala 
361/121 391/131 

qtt age ate gec gec act gtc gtt gec ggc tgc teg teg ggc teg aag cca age. ggc gga 
val ser ile ala ala thr val val ala gly cys ser ser gly ser lys pro ser gly gly 
421/141 451/151 
■ cca ctt ccg gac gcg aag ccg. ctg, gtc gag gag gee acc gcg cag acc aag get etc aag 

pro leu pro Lp ala lys pro leu val glu glu ala thr ala gin thr lys ala leu lys 

511/171 

aac'qcg cac atg gtg ctg acg gtc aac ggc aag ate ccg gga ctg tct ctg aag acg ctg 
HI his me? vai leu thr val asn gly lys ile pro gly leu ser leu lys thr leu 

571/191 

aqc'qqc gat etc acc acc aac ccc acc gee gcg acg gga aac gtc aag etc acg ctg ggt 
sir q!y asp leu thr thr asn pro thr ala ala thr gly asn val lys leu thr leu gly 

631/211 

ggg'tct gat ate gat gec gac ttc gtg gtg ttc gac ggg ate ctg tac gec acc ctg acg 
all ser Lp ile Lp ala asp phe val val phe asp gly ile leu tyr ala thr leu thr 
661/221 691/231 

ccc aac cag tgg age gat ttc ggt ccc gec gec gac ate tac gac ccc gec cag gtg ctg 
pro asn gin trp sir Lp phe gly pro ala ala asp ile tyr asp pro ala gin val leu 

751/251 

lat'ccg gat acc ggc ctg gec aac gtg ctg gcg aat ttc gec gac gca aaa gec .gaa ggg 
asn pro Lp thr gly leu ala asn val leu ala asn phe ala asp ala lys ala glu gly 

f c 811/271 

egg'gat acc ate aae ggc cag aac acc ate cgc ate age ggg aag gta teg gca cag gcg 
arq Lp thr ile asn gly gin asn thr ile arg ile ser gly lys val ser ala gin ala 
841/281 871/291 

ato aac cag ata gcg ccg ccg ttc aac gcg acg cag ccg gtg ccg gcg acc gtc tgg att 
ill asn gtn ile all pro pro phe asn ala thr gin pro val pro ala thr val trp He 



om/301 931/311 
~ca^ gag acc ggc gat eat caa- ctg gca cag gee cag ttg gac cgc .ggc tcg_ggc.aat .tcp 
all glu thr g_y Lp his gin leu ala gin f^f^ leu as P ar * ^ Ser gly aan 

ate' cag atg acc ttg teg aaa tgg ggc gag aag gtc cag gtc acg aag ccc ccg gtg age 
ill g_n me_ thr leu ser lys trp gly glu lys val gin val thr lys pro pro val ser) 



1021/341 
tga 
OPA 



seq id nos: $33-324 



FIGURE 46B 



158/185 



1/1 31/11 



.~4ZS~aaa ctg gtc aac ggc gcc ggc ate gac gac gec gec gtc gtg acc tgc egg ccg gac age 
SBSWff-**^!; HI l al asn lly aia gly ile asp asp ala ala val val thr eys arg pro asp ser 

61/21 9l/31 

ct gcc gat g cc cag cag atg gtc gag gcg gca ctg ggc cga tat ggc cgt ttg gac gga 
ill ala asp ala gin gin met val glu ala ala leu gly arg tyr gly arg leu asp gly 
121/41 151/51 
' qtq ttg gtg gcc teg ggc age aac cat gtg gcg ccc att acc gag atg gcc gtc gag gac 
2 leu 111 ala ser gly ser asn his val ala pro ile thr glu met ala val glu asp 
181/61 211/71 

ttc gac get gtg atg gac gcg aac gtg egg ggt gcc tgg ctg gtg tgt egg gcg gcc gga 
one asp ala val met asp ala asn val arg gly ala trp leu val cys arg ala ala gly 

271/91 

cgg'qtg ctg etc gag cag ggt cag ggc ggc age gtg gtg ctg gtg teg tec gtt ege ggc 
III val leu leu glu gin gly gin gly gly ser val val leu val ser ser val arg gly 

331/111 

qqg'ttg ggc aaf gcc gcc ggt tac age gcg tac tgc ccg teg aag gcg ggc acc gat c 
gly ill gly asn ala ala gly tyr ser ala tyr cys pro ser lys ala gly thr asp) 

seq id tios: ^05-^^ 

FIGURE 47A 



1/1 31/11 
^ftVlv-aac tag tea acg gcg ccg gca teg acg acg ccg ccg tcqtga cct gcc ggc egg aca gcc 

™ th - * la pto ila " r thE thr izjr 
m ;n lit m i% m ss m in sys s: :n s s £ s s ;x 

tat' tag tgg cct egg gca gca acc atg tgg cgc cca tta ccg aga tgg ccg teg agg act 
cys trp trp pro arg ala ala thr met trp arg^pro leu pro arg trp pro ser arg thr 

tcq'a'cg ctg tga tgg acg cga acg tgc ggg gtg cct ggc tgg tgt gtc ggg egg ccg gae 
sir tlr ^eu)0?A(trp thr arg thr eys gly val pro gly trp eys val gly arg pro asp 
241/81 ^V$tfaX&MD-.S»© 271/91 

tgc tgc teg age agg gtc agg gcg gca gcg tgg tgc tgg tgt cgt ccg ttc gcg gcg 
111 lis eys ser ser arg val arg ala ala ala trp cys trp cys arg pro phe ala ala 

331/111 

ggt 7 tgg gca atg ccg ccg gtt aca gcg cgt act gcc cgt cga agg egg gca ccg ate 



SEQ ID K05.'^^7-^ D 
FIGURE 47B 



159/185 



i/i 31/11 
.4RW-iet oat caa egg cgc egg cat cga <:ga cgc cgc cgt cgt gac ctg ccg gec gga cag cct 
J.^-^Si g5 Tin arg arg arg his arg arg arg arg.g arg asp leu pro ala gly gin pro 

^^'^ !i c 2 caa tgc cca gca gat ggt cga ggc ggc act ggg ccg ata tgg ccg ttt gga egg agt 
gly arg cys pro ala asp gly arg gly gly thr gly pro ile trp pro phe gly arg ser 

III' lit ggc etc ggg cag caa cca tgt ggc gee eat tae cga gat ggc cgt cga gga ctt 
111 gly glY !•« gly gin gin pro cys gly ala^his tyr arg asp gly arg arg gly leu 

tat aat gga cgc gaa cgt gcg ggg tgc ctg get ggt gtg teg ggc ggc egg acg 
arg arg cys Ts P gly arg glu arg ala gly cys leu ala gly val ser gly gly arg thr 



241/81 271/91 



III lit act cga gca ggg tea ggg egg cag cgt ggt get ggt gtc gtc cgt teg egg egg 
g?y III Hi arg ala gTy ser gly arg gin arg^gly ala gly val val arg ser arg arg 

lll'lll C aa tgc cgc egg tta cag cgc gta ctg ccc gtc gaa ggc ggg cac cga tc 
Hi gly gin eys arg arg leu gin arg val leu pro val glu gly gly hxs arg) 



SEQ ID KDS' ^-^^ ' 
FIGURE 47C 



160/185 



Coding sequence Rvl714 predicted by Cole et al . , 1998 (Nature 393: 537-544) and 
containing seq 47A: 



1/1 31/11 
, .«*»^*gtg gag gaa atg gcg ctg get cag cag gtg ccg aac ctg ggt ctg gcg cgc ttc age gtg 

*** ^va? glu glu me? Ill leu ala gin gin val P^ asn leu leu ala arg phe Ser Val 

, cag 2 gac aag teg ate ctg ate ace ggc gcg acc ggt teg ttg ggc cga gtt gec gee egg 
gin asp lys ser ile leu ile thr gly ala thr gly ser leu gly arg val ala ala arg 

121/41 151/51 

gcg ctg gec gac gcg gga gcg egg ctg aca ctg gee ggc ggc aae teg gee ggt ctg gee 
ala leu ala asp ala gly ala arg leu thr 1~ 9 lv glv asn ser ala gly leU al * 

tlYllg gtc aac ggc gec ggc ate gac gac gee gec gtc gtg acc tgc egg ccg gac age 
Til til val asn gly ala gly ile asp asp ala ala val val thr cys arg pro asp ser 



271/91 

ci^gee gat gee cag cag atg gtc gag gcg gca ctg ggc cga tat ggc cgt ttg gae gga 
leu ala asp ala gin gin met val glu ala ala leu gly arg tyr gly arg leu asp gly 

-^n i/ioi 331/111 

at a ttg gtg gec teg ggc age aac cat gtg gcg' ccc att acc gag atg gee gtc. gag gae 
2 ill 111 ala ser giy ser asn his val ala pro ile thr glu met ala val glu asp 

391/131 

ttc'gac get gtg atg gac geg aac gtg egg ggt gec tgg ctg gtg tgt egg gcg gee gga 
phe aspaia val me? asp ala asn val arg gly^ala trp leu val cys arg ala ala. gly 

ca^Qtg ctg etc gag cag ggt cag ggc ggc age gtg gtg ctg gtg teg tec gtt cgc ggc 
III S leu leu gll gin gly gin gly gly serial val leu val ser ser val arg gly 

ggg'ttg ggc aat gec gec ggt tac age geg tac tgc ccg teg aag gcg ggc ace gat ctg 
giy leu gly asn ala ala gly tyr ser ala tyr^cys pro ser lys ala gly thr asp leu 

ItVlll aag aca ttg gcg gec gaa tgg ggc ggt eae ggc att egg gtg aac gcg ctg geg 
ill ala lys thr leu ala ala glu trp gly gly his gly He arg val asn ala leu ala 
601/201 631/211 

ccg acg gtg ttt egg tee gcg gtg ace gag tgg atg ttc acc gac gat ccg aag ggc egg 
pro thr val phe arg ser ala val thr glu trp met phe thr asp asp pro lys gly arg 



"fifil/221 691/231 
ace acc egg gag gcg atg etc gec egg ate ccg ttg cgc cgc ttc gee gaa ccg gaa gac 
Til thr arg glu 111 me? leu ala arg ile pro leu arg arg phe ala glu pro glu asp 

ctc'gJc ggc gec ctg ate tat ctg etc age gac gee teg age ttc tac acc ggc cag gtg 
pie Hi gly ala leu ile tyr leu leu ser asp ala ser ser phe tyr thr gly gin val 
781/261 811/271 
atg tat ctg gac ggc ggg tac acc gca tgctga 
met tyr leu asp gly gly tyr thr ala cysjOPA 



seq id tios:%i?>- < ^ l \ 

FIGURE 47D 



161/185 



ORF according to Cole et al., 1998 (Nature 393: 537-544) and containing the 
coding sequence Rvl714: 

24/1 54/11 
seo tea gtg gag gaa atg gcg ctg get cag cag gtg ccg aac ctg ggt ctg gcg cgc ttc age 

AMB (val glu glu met ala leu ala gin gin val pro asn leu gly leu ala arg phe ser 
>C6i ir> to *3to-Q4/21 114/31 

gtg cag gac aag teg ate ctg ate ace ggc gcg ace ggt teg ttg ggc cga gtt gee gee 

val gin asp lys ser ile leu ile thr gly ala thr gly ser leu gly arg val ala ala 

144/41 174/51 

egg gcg ctg gee gac gcg gga gcg egg ctg aca ctg gee ggc ggc aac teg gee ggt ctg 
arg ala leu ala asp ala gly ala arg leu thr leu ala gly gly asn ser ala gly leu 
204/61 234/71 

gee gag ctg gtc aac ggc gee ggc ate gac gac gee gee gtc gtg ace tgc egg ccg gac 
ala glu leu val asn gly ala gly ile asp asp ala ala val val thr cys arg pro asp 
264/81 294/91 

age ctg gee gat gec cag cag atg gtc gag gcg gca ctg ggc cga tat ggc cgt ttg gac 
ser leu ala asp ala gin gin met val glu ala ala leu gly arg tyr gly arg leu asp 
324/101 354/111 

gga gtg ttg gtg gec teg ggc age aac cat gtg gcg ccc att ace gag atg gee gtc gag 
gly val leu val ala ser gly ser asn his val ala pro ile thr glu met ala val glu 
384/121 414/131 

gac ttc gac get gtg atg gac gcg aac gtg egg ggt gee tgg ctg gtg tgt egg gcg gee 
' ' asp phe asp ala val met asp ala 1 asn val arg gly ala trp leu val . cys arg ala ala 

444/141 474/151 

gga egg gtg ctg etc gag cag ggt cag ggc ggc age gtg gtg ctg gtg teg tec gtt cgc 
gly arg val leu leu glu gin gly gin gly gly ser val val leu val ser ser val arg 
504/161 534/171 

ggc ggg ttg ggc aat gee gec ggt tac age gcg tac tgc ccg teg aag gcg ggc acc gat 
gly gly leu gly asn ala ala gly tyr ser ala tyr cys pro ser lys ala gly thr asp 
564/181 594/191 

ctg ttg gec aag aca ttg gcg gee gaa tgg ggc ggt cac ggc att egg gtg aac gcg ctg 
leu leu ala lys thr leu ala ala glu trp gly gly his gly ile arg val asn ala leu 
624/201 654/211 

gcg ccg acg gtg ttt egg tec gcg gtg acc gag tgg atg ttc acc gac gat ccg aag ggc 
ala pro thr val phe arg ser ala val thr glu trp met phe thr asp asp pro lys gly 
684/221 714/231 

egg gee acc egg gag gcg atg etc gee egg ate ccg ttg cgc cgc ttc gec gaa ccg gaa 
arg ala thr arg glu ala met leu ala arg ile pro leu arg arg phe ala glu pro glu 
744/241 774/251 

gac ttc gtc ggc gec ctg ate tat ctg etc age gac gee teg age ttc tac acc ggc cag 
asp phe val gly ala leu ile tyr leu leu ser asp ala ser ser phe tyr thr gly gin 
804/261 834/271 
gtg atg tat ctg gac ggc ggg tac acc gca tgc tga 
val met tyr leu asp gly gly tyr thr ala cys) OPA 



SEQ ID 

FIGURE 47F 



162/185 



1/1 31/11 



,o«*>-837 agg etc atg age aag acg gtt etc ate ctt ggc gcg ggt gtc ggc ggc ctg ace acc gee 
S£a> iD^-SSSarg leu met ser lys thr val leu ile leu gly ala gly val gly gly leu thr thr ala 

61/21 91/31 
gac acc etc cgt caa ctg eta cca cct gag gat c 
asp thr leu arg gin leu leu pro pro glu asp) 



SEQ ID A«5 S37-S38 
FIGURE 48A 



1/1 31/11 
S^u^.a* ggc tea tga gca aga egg ttc tea tec ttg gcg egg gtg teg gcg gee tga cca ccg ccg 
olv ser OPAfala arg arg phe ser ser leu ala arg val ser ala ala)oPAfpro pro pro 
z.K— 1 _ 91/31 se ® £> Ko-SW 



61/21 Sc(sj it>vU>.*MC> 

aca ccc tec gtc aac tgc tac cac ctg agg ate 
thr pro ser val asn cys tyr his 'leu arg ilej 



SEQ ID Kcfc,.«3?-SW 
FIGURE 48B 



1/1 31/11 
SOS> 4>«».«H£ get cat gag caa gac ggt tct cat cct tgg cgc ggg tgt egg egg cct gac cac cgc cga 
^..teJla his glu gin asp gly ser his pro trp arg gly cys arg arg pro asp hxs arg arg 



61/21 

cac cct ccg tea act get acc acc tga gga tc 
his pro pro ser thr ala thr thr)OPA gly 



SEQ ID Wx&.S«2-S<0 
FIGURE 48C 



163/185 



Coding sequence Rv0331 predicted by Cole et al., 1998 (Nature 393: 537-544). and 
containing seq48A: 

1/1 31/11 
,t>«». W atg age aag acg gtt etc ate ctt gge geg ggt gtc ggc ggc ctg ace acc gee gae acc 
- ^ k »*s6let ser lys thr val leu ile leu gly ala gly val gly gly leu thr thr ala asp thr 

etc cat caa ctg eta cca cct gag gat cga ate ata ttg gtg gae agg age ttt gae ggg 
leu arg gin leu leu pro pro glu asp arg ile ile leu val asp arg ser phe asp gly 

121/41 151/51 

acg ctg ggc ttg teg ttg eta tgg gtg ttg egg ggc tgg egg egg cct gae gae gtc cge 
thr leu gly leu ser leu leu trp val leu arg^gly trp arg arg pro asp asp val arg 

ate'ege ccc acc gcg gcg teg ctg ccc ggt gtg gaa atg gtt act gca acc gtc gee cac 
111 arg pro thr 111 111 ser leu pro gly val^lu met val thr ala thr val ala his 

lll'llc ate gcg gee cag gta gtg cac acc gae aac age gtc ate ggc tat gae gcg ttg 
ill lit ile ala ala gin val val his thr asp asn ser val ile gly tyr asp ala leu 
301/10? 331/111 



ota ate gca tta ggt gcg gcg ctg aac acc gae gec gtt ccc gga ctg teg gae gcg etc 
111 lit III leu gly 111 111 leu asn thr asp ala val pro gly leu ser asp ala leu 
361/121 391/131 



^fil/121 j^x/ xjx 

aac acc gae gtc gcg ggc cag ttc tac acc ctg gae ggc gcg get gag ctg cgt gcg aag 
asp lit Hp val 111 gly gin P he tyr thr leu^asp gly ala ala glu leu arg ala lys 

otc'aaq gcg etc gag cat ggc egg ate get gtg get ate gec ggg gtg ccg ttc aaa tgc 
val gll 111 leu glu his gly arg ile ala val ala ile ala gly val pro phe lys cys 
dfil/161 » 1 511/171 ..... 

cca acc gca ccg ttc gaa gcg gcg ttt ctg ate gec gee caa etc ggt gae cgc tac gec 
pro 111 ala pro phe glu ala ala phe leu ile ala ala gin leu gly asp arg tyr ala 

acc'gga acc gta cag ate gae acg ttc acg cct gae ccg ctg ccg atg ccc gtt gca ggt 
thr 111 thr val gin ile lap thr phe thr pro asp pro leu pro met pro val ala gly 
601/201 631/211 

ccc gag gtc ggc gag get ttg gtc teg atg etc aag gat cac ggt gtc ggc ttc cat cct 
pro glu val gly glu ala leu val ser met leu lys asp his gly val gly phe his pro 

fifil/221 691/231 

cgc aag gee eta get cgc gtc gat gag gec gca agg acg atg cac ttc ggt gae gge acg 
arg lys ala leu ala arg val asp glu ala ala arg thr met his phe gly asp gly thr 

721/241 751/251 

tec gaa ccg ttc gat ctg ctt gee gtg gtc ccc ccg cac gtg ccc tec gec geg gcg egg 
ser glu pro phe asp leu leu ala val val P™ P» his val P ro ser ala ala ala arg 

lll'ltl ggt etc age gaa tec ggg tgg ata ccc gtg gae ccg cgc acc ctg tec act age 
ser Til gly leu ser glu ser gly trp ile pro val asp pro arg thr leu ser thr ser 

ft41/281 871/291 

gee gae aac gtg tgg gee ate ggc gat gcg acc gtg ctg acg ctg ccg aat ggc aaa ccg 
Tit asp asn val trp ala ile gly asp ala thr val leu thr leu pro asn gly lys pro 

cini/101 931/311 

ctg ccc aag get gec gtg ttc gec gaa gee cag gec gca gtt gtc gee cac ggc gtc gee 
leu pro ly! La ala vai phe ala glu ala gin ala ala val val ala his gly val ala 

961/321 991/331 

cac cat etc ggt tac gae gta get gag cgc cac ttc acc ggc acg ggc gee tgc tac gtc 
_2g_SS leu Sy tyr asp val lla glu arg his phe thr gly thr gly ala cys tyr val 

ir»2i/341 1051/351 

gag acc ggt gat cac cag gca gec aag ggc gae ggc gat ttc ttc get ccg teg gcg ccc 
glu thr gly asp his gin ala ala lys gly asp gly asp phe phe ala pro ser ala pro 
1081/361 1111/371 



teggtg acg ctg tac ccg ccg teg egg gag ttt cac gag gag aag gtc gca caa gaa ctg 
ser val thr leu tyr pro pro ser arg glu phe his glu glu lys val ala gin glu leu 



1141/381 

gec tgg ctg acc cgc tgg aag acg tga 
ala trp leu thr arg trp lys thrJOPA 



SEQ ID *V5 



FIGURE 48D 
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ORF according to Cole et al., 1998 (Nature 393: 537-544) and containing coding 
sequence Rv0331: 



1/1 



31/11 

iD %<t(o tga aca ccc gcg ccg acg egg cga caa teg egg aaa acc ggt ccg egg gaa tgc tgc ggg 
OPAjfchr pro ala pro thr arg arg gin ser arg lys thr gly pro arg glu cys cys gly 
3e£>fi5«o.y<»7 — I^Tfi 91/31 

. cca tgg gcc gat aat agt ttg act gac teg gtc agt cae ccc aag acc ttg cgc aag act 
pro trp ala asp asn ser leu thr asp ser val ser his pro lys thr leu arg lys thr 
121/41 151/51 

gcg gcg gaa tct aat att cca aag ata tat gga act cga tgc gaa gga ate agg etc atg 
ala ala glu ser asn ile pro lys ile tyr gly thr arg cys glu gly ile arg leu met 
181/61 211/71 

age aag acg gtt etc ate ctt ggc gcg ggt gtc ggc ggc ctg acc acc gcc gac acc etc 
ser lys thr val leu ile leu gly ala gly val gl* gly leu thr thr ala asp thr leu 
241/81 271/91 

cgt caa ctg eta cca cct gag gat cga ate ata ttg gtg gac agg age ttt gac ggg acg 
arg gin leu leu pro pro glu asp arg ile ile leu val asp arg ser phe asp gly thr 
301/101 331/111 

ctg ggc ttg teg ttg eta tgg gtg ttg egg ggc tgg egg egg cct gac gac gtc cgc gtc 
leu gly leu ser leu leu trp val leu arg gly trp arg arg pro asp asp val arg val . 
361/121 391/131 

cgc ccc acc gcg gcg teg ctg ccc ggt gtg gaa atg gtt act gca acc gtc gcc cae att 
arq ■pro thr ala ala ser leu pro gly val glu met val thr ala thr val ala his ile 
421/141 451/151 ^ 

gac ate gcg gcc cag gta gtg cac acc gac aac age gtc ate ggc tat gac gcg ttg gtg 
asp ile ala ala gin val val his thr asp asn ser val ile gly tyr asp ala leu val 
481/161 511/171 

ate gca tta ggt gcg gcg ctg aac acc gac gcc gtt ccc gga ctg teg gac gcg etc gac 
ile ala leu gly ala ala leu asn thr asp ala val pro gly leu ser asp ala leu asp 
541/181 571/191 

qcc gac gtc gcg ggc cag ttc tac acc ctg gac ggc gcg get gag ctg cgt gcg aag gtc 
ala asp val ala gly gin phe tyr thr leu asp gly ala ala glu leu arg ala lys val 
601/201 631/211 

qaq gcg etc gag cat ggc egg ate get gtg get ate gcc ggg gtg ccg ttc aaa tgc cca 
glu ala leu glu his gly arg ile ala val ala ile ala gly val pro phe lys cys pro 
661/221 691/231 

gcc gca ccg ttc gaa gcg gcg ttt ctg ate gcc gcc caa etc ggt gac cgc tac gcc acc 
ala ala pro phe glu ala ala phe leu ile ala ala gin leu gly asp arg tyr ala thr 

*7 241 751/251 

gga acc gta cag ate gac acg ttc acg cct gac ccg ctg ccg atg ccc gtt gca ggt ccc 
alv thr val gin ile asp thr phe thr pro asp pro leu pro met pro val ala gly pro 

811/271 

ga^tc ggc gag get ttg gtc teg atg etc aag gat cac ggt gtc ggc ttc cat cct cgc 
glu val gly glu ala leu val ser met leu lys asp his gly val gly phe his pro arg 
841/281 871/291 

„„aag-gcc cta-gct^cg,c_gjtc_gat„gajg_gc^„gca_ag,g acg atg cac ttc g gt gac ggc acg tec 

lys ala leu ala arg val asp glu ala ala arg thr met his phe gly asp gly thr ser 



SEQ ID KCS,2Vfc-^<<7- 
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901/301 

gaa ccg ttc gat ctg 
glu pro phe asp leu 
961/321 

gcg ggt etc age gaa 
ala gly leu ser glu 
1021/341 

gac aac gtg tgg gcc 
asp asn val trp ala 
1081/361 

ccc aag get gcc gtg 
pro lys ala ala val 
1141/381 

cat etc ggt tac gac 
his leu gly tyr asp 
1201/401 

acc ggt gat cac cag 
thr gly asp his gin 
1261/421 

gtg acg ctg tac ccg 
val thr leu tyr pro 
1321/441 

tgg ctg acc cgc tgg 
trp leu thr arg trp 



ctt gcc gtg gtc ccc 
leu ala val val pro 

tec ggg tgg ata ccc 
ser gly trp ile pro 

ate ggc gat gcg acc 
ile gly asp ala thr 

ttc gcc gaa gcc cag 
phe ala glu ala gin 

gta get gag cgc cac 
val ala glu arg his 

gca gcc aag ggc gac 
ala ala lys gly asp 

ccg teg egg gag ttt 
pro ser arg glu phe 

aag acg tga 
lys thrjopX 



931/311 

ccg cac gtg ccc tec 
pro his val pro ser 
991/331 

gtg gac ccg cgc acc 
val asp pro arg thr 
1051/351 

gtg ctg acg ctg ccg 
val leu thr leu pro 
1111/371 

gcc gca gtt gtc gcc 
ala ala val val ala 
1171/391 

ttc acc ggc acg ggc 
phe thr gly thr gly 
1231/411 

ggc gat ttc ttc get 
gly asp phe phe ala 
1291/431 

cac gag gag aag gtc 
his glu glu lys val 



gcc gcg gcg egg tea 
ala ala ala arg ser 

ctg tec act age gcc 
leu ser thr ser ala 

aat ggc aaa ccg ctg 
asn gly lys pro leu 

cac ggc gtc gcc cgc 
his gly val ala arg 

gcc tgc tac gtc gag 
ala cys tyr val glu 

ccg teg gcg ccc teg 
pro ser ala pro ser 

gca caa gaa ctg gcc 
ala gin glu I'eu ala 



SEQ ID K£s.^(^-Wj( continued) 
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Fragment amplified by PCR based on the sequence similarities with a serine 
protease of the E.coli htrA family (creation of the BamHI site at the 5' end and 
of the SnaBI site at the 3' end) and subcloned into the vector pJVED: 

1/1 31/11 
£S? i^*^ *«S cca tct aca ccg etc aac age egg gee aga cgc tgc egg teg gtg ctg ccg aga agg egg 

Sdfibifc.^.Vq 0> ro ser thr pro leU aSn Ser arg 3la arg ar9 ° yS a v 

61/21 91/31 
• tga tec gtg gcg agt tgt tea tgt cgc ggc gca cca ccg ccg acc aac ggg tgc ttg cca 

OPA/ser val ala ser cys ser cys arg gly ala pro pro pro thr asn gly cys leu pro 
Se«J it>i\^S60^|^ 41 151/51 

tec gtc tga cca acg gta gtt cgc tgc tga tct cca aaa gtc tea age cca ccg aag cag 

ser vali OPA (pro thr val val arg cys) OPA feer pro lys val ser ser pro pro lys gin 

181/61 ' se<^ »0»U>. SSI SSSCKO.S62 2 11/71 

tea tga aca age tgc gtt ggg tgc tat tga teg tgg gtg gga teg ggg tgg egg teg ccg 
ser)0PA(thr ser cys val gly cys tyrjOPAfser trp val gly ser gly trp arg ser pro 

241/81 l «see>^«*.*63 271/91 ,C> ** 

egg tgg ccg ggg gga tgg tea ccc ggg ccg ggc tga ggc egg tgg gee gec tea ccg aag 
arg trp pro gly gly trp ser pro gly pro gly) OPA (gly arg trp ala ala ser pro lys 
301/101 331/111 5€.S> iv^lAO-SfeS 

egg ccg age ggg tgg cgc gaa ccg acg acc tgc ggc cca tec ccg tct teg gca gcg acg 
arg pro ser gly trp arg glu pro thr thr cys gly pro ser pro ser ser ala ala thr 

391/131 

III' III cca ggc tga cag agg cat tea att taa tgc tgc ggg cgc tgg ccg agt cac ggg 
" asn trp pro gly)6PA(gln arg his ser ileJOCHfcys cys gly arg trp pro ser' hxs gly 

aac ggc agg eaa gg^t^t^ecg acg ccg gac atg aat tgc gta ecc cgc taa cgt cgc 
asn gly arg gin gly trp leu pro thr pro asp met asn cys val pro arg)OCHfarg ar g 
481/161 511/171 t5tffi> 858 

tgc gca cca atg teg aac tct tga tgg cct cga tgg ccc egg ggg etc cgc ggc tac cca 
cvs ala pro met ser asn serWftrp pro arg trp pro arg gly leu arg gly tyr pro 

tW) 571/191 

III' III aga tgg teg acc tgc gtf cc*?' atg tgc tgg etc aaa teg agg aat tgt cca cac 
ser arg arg trp ser thr cys val pro met cys trp leu lys ser arg asn cys pro hxs 

631/2 1 1 

til tag gcg att tgg tgg acc tgt ccc gag gcg acg ccg gag aag tgg tgc acg age egg 
trp>MB(ala ile trp trp thr cys pro glu ala thr pro glu lys trp cys thr ser arg 

teg'aea^g^trac^eg teg acc gca gee tgg'agc ggg tea ggc ggc ggc gca acg ata 
ser thr trp leu thr ser ser thr ala ala trp ser gly ser gly gly gly ala thr lie 
721/241 751/251 

tec ttt teg acg teg agg tga ttg ggt ggc agg ttt atg gcg ata ccg ctg gat tgt cgc 
ser phe ser thr ser arg)oPA(leu gly gly arg phe met ala ile pro leu asp cys arg 
7 81/261 ^^BSl ifc* Ho* ^S^p \ 811/271 

gga tgg cgc tta acc tga tgg aca acg ccg cga agt gga gee cgc egg gcg gee acg tgg 
gly trp arg leu thr) OPA^trp thr thr pro arg ser gly ala arg arg ala ala thr trp 

- gtg-tea gge-tga^cc-ag^ct^^^ 

lal ser glyW 6la ser ser thr arg arg thr leu ser trp trp phe pro thr ala ala 

.seta ib %Ce3 



SEQ ID Hoi, , 5 t /s > -.8<?3 
FIGURE 49A 



167/185 



901/301 931/311 

egg gca ttc ccg tgc agg age gee gtc tgg tgt ttg aac ggt ttt acc ggt egg cat egg 
arq ala phe pro cys arg ser ala val trp cys leu asn gly phe thr gly arg his arg 
961/321 991/331 

cac ggg cgt tgc egg gtt egg gee teg ggt tgg cga teg tea aac agg tgg tgc tea acc 
his gly arg cys arg val arg ala ser gly trp arg ser ser asn arg trp cys ser thr 
1021/341 1051/351 

acg gcg gat tgc tgc gca teg aag aca ccg acc cag gcg gec age cce ctg gaa cgt cga 
thr ala asp cys cys ala ser lys thr pro thr gin ala ala ser pro leu glu arg arg 
1081/361 1111/371 

ttt acg tgc tgc tec ccg gee gtc gga tgc cga ttc cgc age ttc ccg gtg cga egg ctg 
phe thr cys cys ser pro ala val gly cys arg phe arg ser phe pro val arg arg leu 
1141/381 1171/391 

gcg etc gga gca egg aca teg aga act etc ggg gtt egg cga acg tta tct cag tgg aat 
ala leu gly ala arg thr ser arg thr leu gly val arg arg thr leu ser gin trp asn 
1201/401 1231/411 

etc agt cca cgc gcg caa cct agt tgt gca gtt act gtt gaa age cac acc cat gec agt 
leu ser pro arg ala gin pro ser cys ala val thr val glu ser his thr his ala ser 
1261/421 1291/431 

cca cgc atg gee aag ttg gee cga gta gtg ggc eta gta cag gaa gag caa cct age gac 
pro arg met ala lys leu ala arg val val gly leu val gin glu glu gin pro ser asp 
1321/441 1351/451 

atg acg aat. cac cca egg tat, teg, cca ccg ccg cag cag ccg gga acc cca ggt tat get 
met thr asn his pro arg tyr ser pro pro pro gin gin pro gly thr pro gly tyr ala 
1381/461 1411/471 

cag ggg cag cag caa acg tac age cag cag ttc gac tgg cgt tac cca ccg tec ccg cce 
qln qly gin gin gin thr tyr ser gin gin phe asp trp arg tyr pro pro ser pro pro 
£441/481 1471/491 

ccq cag cca acc cag tac cgt caa cec tac gag gcg ttg ggt ggt acc egg ccg ggt ctg 
pro gin pro thr gin tyr arg gin pro tyr glu ala leu gly gly thr arg pro gly leu 
1501/501 1531/511 

ata cct ggc gtg att ccg acc atg acg cec cct cct ggg atg gtt cgc caa cgc cct cgt 
ile pro gly val ile pro thr met thr pro pro pro gly met val arg gin arg pro arg 
1561/521 1591/531 

qca ggc atg ttg gee ate ggc gcg gtg acg ata gcg gtg gtg tec gee ggc ate ggc ggc 
ala qly met leu ala ile gly ala val thr ile ala val val ser ala gly ile gly gly 
1621/541 1651/551 

qcq qcc gca tec ctg gtc ggg ttc aac egg gca cce gec ggc cec age ggc ggc cca gtg 
ala ala ala ser leu val gly phe asn arg ala pro ala gly pro ser gly gly pro val 
1681/561 1711/571 

get gec age gcg gcg cca age ate cec gca gca aac atg ccg ccg ggg teg gtc gaa cag 
ala ala ser ala ala pro ser ile pro ala ala asn met pro pro gly ser val glu gin 
1741/581 1771/591 

qtg gcg gec aag gtg gtg cce agt gtc gtc atg ttg gaa acc gat ctg ggc cgc cag teg 
val 111 ala lys val val pro ser val val met leu glu thr asp leu gly arg gin ser 
1801/601 1831/611 

gad gag ggc tec ggc ate att ctg tct gee gag ggg ctg ate ttg acc aac aac cac gtg 
111 all gly ser gly ile ile leu ser ala glu gly leu ile leu thr asn asn his val 
1861/621 1891/631 

ate gcg gcg gec gee aag cct cce ctg ggc agt ccg ccg ccg aaa acg acg gta 
ile ala ala ala ala lys pro pro leu gly ser pro pro pro lys thr thr val) 



SEQ IDkos."St/S.a«3 continued 1) 
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!/l 31/11 

gj^vlo.^ cat eta cac cgc tea aca gcc ggg cca gac get gcc ggt egg tgc tgc cga gaa ggc ggt 

^/his leu his arg ser thr ala gly pro asp ala ala gly arg cys cys arg glu gly gly 
-»*>*e*»6 1/21 91/31 

gat ccg tgg cga gtt gtt cat gtc gcg gcg cac cac cgc cga cca acg ggt get tgc cat 
asp pro trp arg val val his val ala ala his his arg arg pro thr gly ala cys his 
121/41 151/51 

ccg tct gac caa egg tag ttc get get gat etc caa aag tct caa gcc cac cga age agt 
pro ser asp gin arg}AMB(phe ala ala asp leu gin lys ser gin ala his arg ser ser 
181/61 -5£fc ^C£<c> 211/71 

cat gaa caa get gcg ttg ggt get att gat cgt ggg tgg gat egg ggt ggc ggt cgc cgc 
his glu gin ala ala leu gly ala ile asp arg gly trp asp arg gly gly gly arg arg 
241/81 271/91 

ggt ggc egg ggg gat ggt cac ccg ggc egg get gag gcc ggt ggg ccg cct cac cga age 
gly gly arg gly asp gly his pro gly arg ala glu ala gly gly pro pro his arg ser 
301/101 331/111 

ggc cga gcg ggt ggc gcg aac cga cga cct gcg gcc cat ccc cgt ctt egg cag cga cga 
gly arg ala gly gly ala asn arg arg pro ala ala his pro arg leu arg gin arg arg 
361/121 391/131 

att ggc cag get gac aga ggc att caa ttt aat get gcg ggc get ggc cga gtc acg gga 
ile gly gin ala asp arg gly ile gin phe asn ala ala gly ala gly arg val thr gly 
421/141 451/151 

acg gca ggc aag get ggt tac cga cgc egg aca tga att gcg tac ccc get aac gtc get 
thr ala gly lys ala gly tyr arg arg arg thr\oPA[ile ala tyr pro ala asn val ala 
481/161 ' 511/171 SCS> »C> t^o,%fc>7 

gcg cac caa tgt cga act ctt gat ggc etc gat ggc ccc ggg ggc tec gcg get acc caa 
ala his gin cys arg thr leu asp gly leu asp gly pro gly gly ser ala ala thr gin 
541/181 571/191 

gca gga gat ggt cga cct gcg tgc cga tgt get ggc tea aat cga gga att gtc cac act 
ala gly asp gly arg pro ala cys arg cys ala gly ser asn arg gly ile val his thr 
601/201 631/211 

ggt agg cga ttt ggt gga cct gtc ccg agg cga cgc egg aga agt ggt gca cga gcc ggt 
gly arg arg phe gly gly pro val pro arg arg arg arg arg ser gly ala arg ala gly 
661/221 691/231 

cga cat ggc tga cgt cgt cga ccg cag cct gga gcg ggt cag gcg gcg gcg caa cga tat 
arg his glyloPA(arg arg arg pro gin pro gly ala gly gin ala ala ala gin arg tyr 
721/241 £>03> tf^>vU>. 'Stf,* 751/251 

cct ttt cga cgt cga ggt gat tgg gtg gca ggt tta tgg cga tac cgc tgg att gtc gcg 
pro phe arg arg arg gly asp trp val ala gly leu trp arg tyr arg trp ile val ala 
781/261 811/271 

gat ggc get taa cct gat gga caa cgc cgc gaa gtg gag ccc gcc ggg egg cca cgt ggg 
asp gly ala) OCH (pro asp gly gin arg arg glu val glu pro ala gly arg pro arg gly 
841/281 SOS> .T-> 871/291 

tgt cag get ga^cca get cga cgc gtc gca cgc tga get ggt ggt ttc cga ccg egg ccc 
cys gin ala glu pro ala arg arg val ala arg^OPA (ala gly gly phe arg pro arg pro 
901/301 931/311 It* *S'**?6> 

ggg cat tec cgt gca gga gcg ccg tct ggt gtt tga acg gtt tta ccg gtc ggc ate ggc 

. — g±y . . his ser arg-al^-gl-y ala-pro^ ser- gly- val)-OPA (thr „yal„leu^.rp„^l jgly lie gly 

961/321 991/331 5€jS) 

acg ggc gtt gcc ggg ttc ggg cct egg gtt ggc gat cgt caa aca ggt ggt get caa cca 
thr gly val ala gly phe gly pro arg val gly asp arg gin thr gly gly ala gin pro 
1021/341 1051/351 

egg egg att get gcg cat cga aga cac cga ccc agg egg cca gcc ccc tgg aac gtc gat 
arg arg ile ala ala his arg arg his arg pro arg arg pro ala pro trp asn val asp 



SEQ ID Ko2>.S6?V*S77 
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1081/361 

tta cgt get get ccc egg ccg teg gat gee 
leu arg ala ala pro arg pro ser asp ala 
1141/381 

cgc teg gag cac gga cat cga gaa etc teg 
arg ser glu his gly his arg glu leu ser 
1201/401 

tea gtc cac gcg cgc aac eta gtt gtg cag 
ser val his ala arg asn leu val val gin 
1261/421 

cac gca tgg cca agt tgg ccc gag tag tgg 
his ala trp pro ser trp pro glu)AMB trp 
1321/441 

tga cga ate acc cac ggt att cgc cac cgc 
(DPAfarg ile thr his gly ile arg his arg 
i4aS73 ^1^81/461 

agg ggc age age aaa cgt aca gee age agt 
arg gly ser ser lys arg thr ala ser ser 
1441/481 

cgc age caa ccc agt acc gtc aac cct acg 

•• arg ser gin pro ser thr val asn pro thr 

1501/501 

tac ctg gcg tga ttc cga cca tga cgc ccc 
tyr leu ala OPA phe arg pro OPA (arg pro 
1561/521 Se<s> ifcrtaffW 

cag gca tgt tgg cca teg gcg egg tga cga 
gin ala cys trp pro ser ala arg) OPA arg 
1621/541 

egg ccg cat ccc tgg teg ggt tea acc ggg 
arg pro his pro trp ser gly ser thr gly 
1681/561 

ctg cca gcg egg cgc caa gca tec ccg cag 
leu pro ala arg arg gin ala ser pro gin 
1741/581 

tgg egg cca agg tgg tgc cca gtg teg tea 
trp arg pro arg trp cys pro val ser ser 
1801/601 

agg agg get ccg gca tea ttc tgt ctg ccg 
arg arg ala pro ala ser phe cys leu pro 
1861/621 

teg egg egg ccg cca age etc ccc tgg gca 
(ser arg arg pro pro ser leu pro trp ala 

S£S U>Ko.?77 



1111/371 

gat tec gca get tec egg tgc gac ggc tgg 
asp ser ala ala ser arg cys asp gly trp 
1171/391 

ggg ttc ggc gaa cgt tat etc agt gga ate 
gly phe gly glu arg tyr leu ser gly ile 
1231/411 

tta ctg ttg aaa gec aca ccc atg cca gtc 
leu leu leu lys ala thr pro met pro val 
1291/431 

gee tag tac agg aag age aac eta gcg aca 
ala AMB (tyr arg lys ser asn leu ala thr) 
1351/451^ t DK0.87£. 

cgc age age egg gaa ccc cag gtt atg etc 
arg ser ser arg glu pro gin val met leu 
1411/471 

teg act ggc gtt acc cac cgt ccc cgc ccc 
ser thr gly val thr his arg pro a'rg pro 
1471/491 

agg cgt tgg gtg gta ccc ggc egg gtc tga 
arg arg trp val val pro gly. arg. vay. OPA 
1531/511 

etc ctg gga tgg ttc gee aac gec etc gtg ) 
leu leu gly trp phe ala asn ala leu val 
1591/531 

tag egg tgg tgt ccg ccg gca teg gcg gcg 
AMB^arg trp cys pro pro ala ser ala ala 
1651/551 *5^<2> >£> Ko ^£> 

cac ccg ccg gee cca gcg gcg gee cag tgg 
his pro pro ala pro ala ala ala gin trp 
1711/571 

caa aca tgc cgc egg ggt egg teg aac agg 
gin thr cys arg arg gly arg ser asn arg 
1771/591 

tgt tgg aaa ccg ate tgg gec gee agt egg 
cys trp lys pro ile trp ala ala ser arg 
1831/611 

agg ggc tga tct tga cca aca acc acg tga 
arg glv) OPA ser OPA^pro thr thr thr)oPA 
1891/631 £>K£-y76? 
gtc cgc cgc cga aaa cga egg ta 
val arg arg arg lys arg arg) 
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1/1 31/11 



- a, ate tac acc get caa cag ccg ggc cag acg ctg ccg gtc ggt get gec gag aag gcg gtg 

^ ,r>^-S7S tac a^ g^ g ^ ^ ^ ^ leu prQ val gly ala ala glu lys ala val 

se&.Difo.S'rt 61/21 91/31 

ate cat ggc gag ttg ttc atg teg egg cgc acc acc gec gac caa egg gtg ctt gee ate 
lie arg gly glu leu phe met ser arg arg thr thr ala asp gin arg val leu ala He 



i?i/41 151/51 
cat ctg acc aac ggt agt teg ctg ctg ate tec aaa agt etc aag ccc acc gaa gca gtc 
1% leu \lr asn gly ser ser leu leu ile ser^lys ser leu lys pro thr glu ala val 

iti^ac aac ctg cgt tgg gtg eta ttg ate gtg ggt ggg ate ggg gtg gcg gtc gee gcg 
III asn lyl 1*1 arg trp val leu leu ile vainly gly ile gly val ala val ala ala 

1*1' lie ggg ggg atg gtc acc egg gec ggg ctg agg ccg gtg ggc cgc etc acc gaa gcg 
val ala gly gly me? val thr arg ala gly lev^arg pro val gly arg leu thr glu ala 

acc'aaq egg gtg gcg ega acc gac gac ctg egg ccc ate ccc gtc ttc ggc age gac gaa 
ala glu arg val 111 arg thr asp asp leu arg^pro ile pro val phe gly ser asp glu 

tta'acc agg ctg aca gag gca ttc aat tta atg ctg egg gcg ctg gec gag tea egg gaa 
ill Tit arg leu thr glu ala phe asn leu met^eu arg ala leu ala glu ser arg glu 

caa 7 caa gca agg ctg gtt acc gac gee gga eat gaa ttg cgt acc ccg eta acg teg ctg 
are gin 11a arg leu val thr asp ala gly his glu leu arg thr pro leu thr set leu 
dJ -y ^ 511/171 

cac'acc aat gtc gaa etc ttg atg gee teg atg gec ccg ggg get ccg egg eta ccc aag 
arg thr "n val glu leu leu met ala ser met^ala pro gly ala pro arg leu pro lys 

541/181 . _ «f r-^rr net caa ate aaq gaa ttg tec aca ctg 

leu 



caa aaa atg gtc gac ctg cgt gec gat gtg ctg get caa ate gag gaa ttg tec aca 
gin glu 2 val Lp leu arg ala asp val leu ala gin ile glu glu leu ser thr 
*m /?m 631/211 

' aat tt g gtg gac ctg tec ega ggc gac gec gga gaa gtg gtg cae gag ccg gtc 
val gly lsp ill val Lp leu ser arg gly asp ala gly glu val val hxs glu pro val 
/oil 691/231 

£2 E E £ S E £ E 2 5! 2 S E E 3 E E E - 
2 2 5 2 E 2 £ E S £ EE E 2 2 E E - 2! E 
2H £ £ S 2 £ £ E S E E £ £ £ £ £ E - - 
£ £ £ £ £ £ £ £ E £ EE £ » £ £ £ £ £ £ 
if £ £ E1ETE £ £ £ £ EE £ - = » K 

£ E £ £ £ £ £ £ £ a £ £ 2 £ £ £ £ E E E 
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m?i/341 1051/351 

qqc gga ttg ctg cgc ate gaa gac acc gac cca ggc ggc cag ccc cct gga acg teg att 
gly gly leu leu arg lie glu asp thr asp P£ j^* 1 * gln pr ° pr ° gly ^ 11 
lllKlV'ct.a etc ccc ggc cgt egg atg ccg att ccg cag ctt ccc ggt gcg acg get ggc 
tyr III ttu tt» pro gly arg arg me? pro i^PJJ^" ^ U Pr ° ^ ^ ^ ^ ^ 

irtcgg^gcacg gac ate gag aae tet egg ggt teg gcg aac gtt ate tea gtg gaa tct 
111 ara ser thr asp ile glu asn ser arg gly ser ala asn val lie ser val glu ser 
i9fti/4ni 1231/411 

all tec acg cgc gca acc tag ttg tgc agt tac tgt tga aag cca cac cca tgc cag tec 
all ser thr arg ala thr^AMB (leu cys ser tyr cy S )oPA(iys pro his pro cys gin ser 
1261/421 VC> Ko. 880 1291/431 ^CQ «t> He 8S' 



aca cat ggc caa gtt ggc ccg agt agt ggg cct agt aca gga aga gca acc tag cga eat 
thr hit gly gin val gly pro ser ser gly pro ser thr gly arg ala thr)AMBfarg his 
1^1/441 1351/451 S3fc*D«a382 

lit nil tea ccc acg gta ttc gee acc gee gca gca gec ggg aac ccc agg tta tgc tea 
asp glu ser pro thr val phe ala thr ala ala' ala ala gly asn pro arg leu cys ser 
"F r 1411/471 

qq^ac^gca gca aac gta cag cca gca gtt cga ctg gcg tta ccc ace gtc ccc gee ccc 
III 111, Til ala asn val gin pro ala val arg leu ala leu pro thr val pro ala pro 

1471/491 

"^^aac cca gta ccg tea acc eta cga ggc gtt ggg tgg tac ccg gee ggg tct gat 
111 Tit Tsn pro val pro ser thr leu arg gly J*^* tr P tvr P ro ala gly ser 3SP 
ac^ta^cgt gat tee gac cat gac gec ccc tec tgg gat ggt teg cca acg ccc teg tgc 
thr trp arg Lp ser asp his asp ala pro JJJ ^"P gly SGE pr ° thr Pt ° " r CyS 
HtKlV'aZt: aac cat egg cgc ggt gac gat age ggt ggt gtc cgc egg cat egg egg cgc 
arg his 111 HI his arg arg lly asp asp "^ly^Y val arg arg his arg arg arg 

illKll 1 ate cct ggt egg gtt caa ccg ggc acc cgc egg ccc cag egg egg ccc agt ggc 
gly arg 111 pro gly arg val gin pro gly ^arg^rg P" gin arg arg pro ser gly 

tqc'cac'cgc ggc gee aag eat ccc cgc age al^eat gee gec ggg gtc ggt cga aca ggt 
cys" HI arg Hy ala lyl his pro arg ser ly^J^ * la gly Val ^ ^ thr « ly 

1 nlt U aaV' C aa qgt ggt gee cag tgt cgt cat gtt gga aac cga tct ggg ccg cca gtc gga 
lly gly Tin gly Hy ala gin cys arg his val jl^asn arg ser gly pro pro val gly 
1801/601 ctt caa caa cca cgt gaC 

gfy g?y leu arg £S his ser val cys arg ITy ala Lp leu asp gin gin pro arg asp 

1891/631 

cqfggfggc cgc caa gec tec cct ggg cag tec gec gec gaa aac gac ggt a 
■-Wgly g?y arg gin ala ser- pro gly -gin- ser_ala_.ala_.glu_asn. as P _qly)_ 
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Coding sequence Rv0983 predicted by Cole et al., 1998 (Nature 393:537-544) and 
containing seq60A: 

1/1 

* iY\ m z 53?«s atg gcc aag ttg gcc cga gta gtg ggc eta gta cag gaa gag caa cct age gac atg acg 
^ **** (Met ala lys leu ala arg val val gly leu val gin glu glu gin pro ser asp met thr 

2^lDKaSS4 61/21 91/31 

aat cac cca egg tat teg cca ccg ccg cag cag ccg gga acc cca ggt tat get cag ggg 
asn his pro arg tyr ser pro pro pro gin gin pro gly thr pro gly tyr ala gin gly 
121/41 151/51 

cag cag caa acg tac age cag cag ttc gac tgg cgt tac cca ccg tec ccg ccc ccg cag 
oln qln gin thr tyr ser gin gin phe asp trp arg tyr pro pro ser pro pro pro gin 
181/61 211/71 

cca acc cag tac cgt caa ccc tac gag gcg ttg ggt ggt acc egg ccg ggt ctg ata cct 
pro thr gin tyr arg gin pro tyr glu ala leu gly gly thr arg pro gly leu ile pro 
241/81 271/91 

ggc gtg att ccg acc atg acg ccc cct cct ggg atg gtt cgc caa cgc cct cgt gca ggc 
qlv val ile pro thr met thr pro pro pro gly met val arg gin arg pro arg ala gly 
301/101 331/111 

atg ttg gcc ate ggc gcg gtg acg ata gcg gtg gtg tec gcc ggc ate ggc ggc gcg gcc 
met leu ala ile gly ala val thr ile ala val val ser ala gly ile gly gly ala ala 
361/121 391/131 

gca tec ctg gtc ggg ttc aac egg ,gca ccc gcc ggc ccc age ggc ggc cca gtg get gcc 
ala ser leu val gly phe asn arg ala pro ala gly pro ser gly gly pro val ala ala 
421/141 451/151 

age gcg gcg cca age ate ccc gca gca aac atg ccg ccg ggg teg gtc gaa cag gtg gcg 
ser ala ala pro ser ile pro ala ala asn met pro pro gly ser val glu gin val ala 
481/161 511/171 

gcc aag gtg gtg ccc agt gtc gtc atg ttg gaa acc gat ctg ggc cgc cag teg gag gag 
ala lys val val pro ser val val met leu glu thr asp leu gly arg gin ser glu glu 
541/181 571/191 

ggc tec ggc ate att ctg tct gcc gag ggg ctg ate ttg acc aac aac cac gtg ate gcg 
gly ser gly ile ile leu ser ala glu gly leu ile leu thr asn asn his val lie ala 
601/201 631/211 

acq gcc gcc aag cct ccc ctg ggc agt ccg ccg ccg aaa acg acg gta acc ttc tct gac 
ala ala ala lys pro pro leu gly ser pro pro pro lys thr thr val thr phe ser asp 
661/221 691/231 

qqq egg acc gca ccc ttc acg gtg gtg ggg get gac ccc acc agt gat ate gcc gtc gtc 
gly arg thr ala pro phe thr val val gly ala asp pro thr ser asp ile ala val val 
721/241 751/251 

cgt gtt cag ggc gtc tec ggg etc acc ccg ate tec ctg ggt tec tec teg gac ctg agg 
ara val gin gly val ser gly leu thr pro ile ser leu gly ser ser ser asp leu arg 

261 811/271 
gtc ggt cag ccg gtg ctg gcg ate ggg teg ccg etc ggt ttg gag ggc acc gtg acc acg 
val gly gin pro val leu ala ile gly ser pro leu gly leu glu gly thr val thr thr 



SEQ ID *2"V 
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841/281 

ggg ate gtc age get etc aac 
gly ile val ser ala leu asn 
901/301 

acc gtg ctg gac gec att cag 
thr val leu asp ala ile gin 
961/321 

ctg gtg aac atg aac get caa 
leu val asn met asn ala gin 
1021/341 

gac tea gee gat gcg cag age 
asp ser ala asp ala gin ser 
1081/361 

gec aag cgc ate gec gac gag 
ala lys arg ile ala asp glu 
1141/381 

gtg cag gtg acc aat gac aaa 
val gin val thr asn asp lys 
1201/401 

ggt get gec gcg aac get gga 
gly ala ala ala asn ala gly 
1261/421 

ccg ate aac age gcg gac gcg 
pro ile' asn ser ala asp ala 
1321/441 

gtg gcg eta acc ttt cag gat 
val ala leu thr phe gin asp 
1381/461 

aag gcg gag cag tga 
lys ala glu glnJOPA 



871/291 
cgt cca gtg teg acg acc 
arg pro val ser thr thr 

931/311 
acc gac gec gcg ate aac 
thr asp ala ala ile asn 

991/331 
etc gtc gga gtc aac teg 
leu val gly val asn ser 

1051/351 
ggc teg ate ggt etc ggt 
gly ser ile gly leu gly 

1111/371 
ttg ate age acc ggc aag 
leu ile ser thr gly lys 

1171/391 
gac acc ctg ggc gee aag 
asp thr leu gly ala lys 

1231/411 
gtg ccg aag ggc. gtc gtt 
val pro lys gly val val 

1291/431 
ttg gtt gee gee gtg egg 
leu val ala ala, val arg 

1351/451 
ccc teg ggc ggt age cgc 
pro ser gly gly ser arg 



ggc gag gec ggc aac cag aac 
gly glu ala gly asn gin asn 

ccc ggt aac tec ggg ggc gcg 
pro gly asn ser gly gly ala 

gee att gee acg ctg ggc gcg 
ala ile ala thr leu gly ala 

ttt gcg att cca gtc gac cag 
phe ala ile pro val asp gin 

gcg tea cat gec tec ctg ggt 
ala ser his ala ser leu gly 

ate gtc gaa gta gtg gee ggt 
ile val glu val val ala gly 

gtc acc aag gtc gac gac cgc 
val thr lys val asp asp arg 

tec aaa gcg ccg ggc gee acg 
ser lys ala pro gly ala thr 

aca gtg caa gtc acc etc ggc 
thr val gin val thr leu gly 



SEQ ID Kc&'3S3'*8S c /( continued 1) 
FIGURE 4 9D (continued 1) 



174/185 



ORF according to Cole et al., 1998 (Nature 393:537-544) and containing Rv0983 

1/1 31/11 
^ib^tga gcc age teg acg cgt cgc acg ctg age tgg tgg ttt ccg acc gcg gec egg gca ttc 
OPAYala ser ser thr arg arg thr leu ser trp trp phe pro thr ala ala arg ala phe 
S£*2 ibiAo.^-gi/^l 91/31 

ccg tgc agg age gee gtc tgg tgt ttg aac ggt ttt acc ggt egg cat egg cac ggg cgt 
pro cys arg ser ala val trp cys leu asn gly phe thr gly arg his arg his gly arg 
121/41 151/51 

tgc egg gtt egg gee teg ggt tgg cga teg tea aac agg tgg tgc tea acc acg gcg gat 
cvs arg val arg ala ser gly trp arg ser ser asn arg trp cys ser thr thr ala asp 
181/61 211/71 

tgc tgc gca teg aag aca ccg acc cag gcg gee age ccc ctg gaa cgt cga ttt acg tgc 
cvs cys ala ser lys thr pro thr gin ala ala ser pro leu glu arg arg phe thr cys 
241/81 271/91 

tgc tec ccg gee gtc gga tgc cga ttc cgc age ttc ccg gtg cga egg ctg gcg etc gga 
cys ser pro ala val gly cys arg phe arg ser phe pro val arg arg leu ala leu gly 
301/101 331/111 

gca egg aca teg aga act etc ggg gtt egg cga acg tta tct cag tgg aat etc agt cca 
ala arg thr ser arg thr leu gly val arg arg thr leu ser gin trp asn leu ser pro 
361/121 391/131 

cgc gcg caa cct agt tgt gca gtt act gtt gaa age cac acc cat gee agt cca cgc atg 
arq ala gin pro ser cys ala val thr val glu ser his thr his ala ser pro arg met 
421/141 451/151 

gee aag ttg gee cga gta gtg ggc eta gta cag gaa gag caa cct agc.gac atg acg. aat 
ala lys leu ala arg val val gly leu val gin glu glu gin pro ser asp met thr asn 
481/161 511/171 

cac cca egg tat teg cca ccg ccg cag cag ccg gga acc cca ggt tat get cag ggg cag 
his pro arg tyr ser pro pro pro gin gin pro gly thr pro gly tyr ala gin gly gin 
541/181 571/191 

cag caa acg tac age cag cag ttc gac tgg cgt tac cca ccg tec ccg ccc ccg cag cca 
gin gin thr tyr ser gin gin phe asp trp arg tyr pro pro ser pro pro pro gin pro 
601/201 631/211 

acc cag tac cgt caa ccc tac gag gcg ttg ggt ggt acc egg ccg ggt ctg ata cct ggc 
thr qln tyr arg gin pro tyr glu ala leu gly gly thr arg pro gly leu ile pro gly 
661/221 691/231 

gtg att ccg acc atg acg ccc cct cct ggg atg gtt cgc caa cgc cct cgt gca ggc atg 
val ile pro thr met thr pro pro pro gly met val arg gin arg pro arg ala gly met 
721/241 751/251 

ttg gee ate ggc gcg gtg acg ata gcg gtg gtg tec gee ggc ate ggc ggc gcg gee gca 
leu ala ile gly ala val thr ile ala val val ser ala gly ile gly gly ala ala ala 
781/261 811/271 

tec ctg gtc ggg ttc aac egg gca ccc gee ggc ccc age ggc ggc cca gtg get gee age 
ser leu val gly phe asn arg ala pro ala gly pro ser gly gly pro val ala ala ser 
841/281 871/291 

gcg gcg cca age ate ccc gca gca aac atg ccg ccg ggg teg gtc gaa cag gtg gcg gee 
ala ala pro ser ile pro ala ala asn met pro pro gly ser val glu gin val ala ala 
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901/301 931/311 

aag gtg gtg ccc agt gtc gtc atg ttg gaa acc gat ctg ggc cgc cag teg gag gag ggc 
lys val val pro ser val val met leu glu thr asp leu gly arg gin ser glu glu gly 
961/321 991/331 

tec ggc ate att ctg tct gcc gag ggg ctg ate ttg acc aac aac cac gtg ate gcg gcg 
ser gly ile ile leu ser ala glu gly leu ile leu thr asn asn his val ile ala ala 
1021/341 1051/351 

gcc gcc aag cct ccc ctg ggc agt ccg ccg ccg aaa acg acg gta acc ttc tct gac ggg 
ala ala lys pro pro leu gly ser pro pro pro lys thr thr val thr phe ser asp gly 
1081/361 1111/371 

egg acc gca ccc ttc acg gtg gtg ggg get gac ccc acc agt gat ate gcc gtc gtc cgt 
are thr ala pro phe thr val val gly ala asp pro thr ser asp ile ala val val arg 
1141/381 1171/391 

gtt cag ggc gtc tec ggg etc acc ccg ate tec ctg ggt tec tec teg gac ctg agg gtc 
val gin gly val ser gly leu thr pro ile ser leu gly ser ser ser asp leu arg val 
1201/401 1231/411 

ggt cag ccg gtg ctg gcg ate ggg teg ccg etc ggt ttg gag ggc acc gtg acc acg ggg 
qlv gin pro val leu ala ile gly ser pro leu gly leu glu gly thr val thr thr gly 
1261/421 1291/431 

ate gtc age get etc aac cgt cca gtg teg acg acc ggc gag gcc ggc aac cag aac acc 
ile val ser ala leu asn arg pro val ser thr thr gly glu ala gly asn gin asn thr 
1321/441 1351/451 

■gtg "ctg' gac gcc att cag acc gac gcc gcg ate. aac ccc ggt aac tec ggg . ggc gcg, ctg 
val leu asp ala ile gin thr asp ala ala ile asn pro gly asn ser gly gly ala leu 
1381/461 1411/471 

gtg aac atg aac get caa etc gtc gga gtc aac teg gcc att gcc acg ctg ggc gcg gac 
val asn met asn ala gin leu val gly val asn ser ala ile ala thr leu gly ala asp 
1441/481 1471/491 

tea gcc gat gcg cag age ggc teg ate ggt etc ggt ttt gcg att cca gtc gac cag gcc 
ser ala asp ala gin ser gly ser ile gly leu gly phe ala ile pro val asp gin ala 
1501/501 1531/511 

aag cgc ate gcc gac gag ttg ate age acc ggc aag gcg tea cat gcc tec ctg ggt gtg 
lys arg ile ala asp glu leu ile ser thr gly lys ala ser his ala ser leu gly val 
1561/521 1591/531 

cag gtg acc aat gac aaa gac acc ctg ggc gcc aag ate gtc gaa gta gtg gcc ggt ggt 
gin val thr asn asp lys asp thr leu gly ala lys ile val glu val val ala gly gly 
1621/541 1651/551 

get gcc gcg aac get gga gtg ccg aag ggc gtc gtt gtc acc aag gtc gac gac cgc ccg 
ala ala ala asn ala gly val pro lys gly val val val thr lys val asp asp arg pro 
1681/561 1711/571 

ate aac age gcg gac gcg ttg gtt gcc gcc gtg egg tec aaa gcg ccg ggc gcc acg gtg 
ile asn ser ala asp ala leu val ala ala val arg ser lys ala pro gly ala thr val 
1741/581 1771/591 

gcg eta acc ttt cag gat ccc teg ggc ggt age cgc aca gtg caa gtc acc etc ggc aag 
ala leu thr phe gin asp pro ser gly gly ser arg thr val gin val thr leu gly lys 
1.8.0,1 /_6 01 _ ... . ____ _ . .. 



gcg gag cag tga 
ala glu gln)oPA 
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Fragment ainplified by PCR based on the sequence similarities with a serine protease of 
the E.coli HtrA family (creation of an SnaBI site at the 3' end) and subcloned into the 
vector pJVEDa: 

1/1 31/11 
iiMAo Ifcl gat ccg gcg ggg egg gtg teg gcg cag gcg tgg ctg gcg gtc acg gcg gtg egg gcg gtg 
fasp pro ala gly arg val ser ala gin ala trp leu ala val thr ala val arg ala val 
iSz® *D *o 3® \>1/21 91/31 

ccg ccg ggc tgt ggg gcg ccg gcg gcg gcg gtg gca atg gcg gga acg gcg ccg atg cca 
pro pro gly cys gly ala pro ala ala ala val ala met ala gly thr ala pro met pro 
121/41 151/51 

aca teg tea gcg gtg gag acg gtg gee teg gcg gtg ccg gtg gcg gtg gcg gat ggc tct 
thr ser ser ala val glu thr val ala ser ala val pro val ala val ala asp gly ,ser 
181/61 211/71 

acg gcg acg gcg ggg ccg gcg gac acg gcg gac aag gcg caa teg gec teg gcg gcg gcg 
thr ala thr ala gly pro ala asp thr ala asp lys ala gin ser ala ser ala ala ala 
241/81 271/91 

ccg gcg gcg acg ggg gec agg gcg gcg ccg gec gcg gac tgt ggg gta ctg gcg gcg ccg 
pro ala ala thr gly ala arg ala ala pro ala ala asp cys gly val leu ala ala pro 
301/101 331/111 

gcg gac acg gcg ggc aag gcg gtg gta ccg ggg gec cac cgc tgc ccg gtc agg cag gca 
ala asp thr ala gly lys ala val val pro gly ala his arg cys pro val arg gin ala 
361/121 391/131 

tgg gcg ccg egg gtg gcg ccg gtg ggc tga teg gca acg gcg ggg ccg gcg gcg acg gcg 
trp ala pro arg val ala pro val gly)oPA/ser ala ( thr ala gly pro ala ala thr ala 
421/141 451/151 3^0>KO. ^9 

gtg teg gcg cgt ccg gcg ggg teg ccg gaa tag gcg gtg ccg gcg gga acg cca tgc tga 
val ser ala arg pro ala gly ser pro glu)AMB/ala val pro ala gly thr pro cys) OPA 
481/161 511/171 ses |^>K0- w 

* 'teg" ggc acg gcg gcg ccg gcg gcg ccg 1 gcg gag aca gca gtt teg eta atg gcg egg ccg 
fser alv thr ala ala pro ala ala pro ala glu thr ala val ser leu met ala arg pro 
£»fc>*oWI W/ISI 571/191 

gcg gcg egg gcg gtg ccg gag ggc acc tct teg gca atg gcg ggt ccg gcg gee acg gcg 
ala ala arg ala val pro glu gly thr ser ser ala met ala gly pro ala ala thr ala 
601/201 631/211 

gag ccg tea egg ccg gca aca ccg gta teg gtg gcg ccg gcg gcg teg gtg ggg acg cca 
glu pro ser arg pro ala thr pro val ser val ala pro ala ala ser val gly thr pro 
661/221 691/231 

ggc tga teg gec acg gtg gcg ccg gcg gtg ccg gcg ggg acc gcg ccg gag cct tgg ttg 
gly) OPA (ser ala thr val ala pro ala val pro ala gly thr ala pro glu pro trp leu 
721/241 iOKc. 751/251 

gee gtg acg gcg ggc ccg gtg gga acg ggg gcg ctg gcg gee age tat acg gca acg gcg 
ala val thr ala gly pro val gly thr gly ala leu ala ala ser tyr thr ala thr ala 
781/261 811/271 

gcg acg gcg ccc ccg gca ccg gcg gaa cac tgc agg egg egg tga gcg gat tgg tga egg 
ala thr ala pro pro ala pro ala glu his cys arg arg arg)OPA ala asp trp OPAfarg 
841/281 871/291 <3aa> rte. S<?3 

ctt tgt teg gtg cac ccg gee aac ccg gcg aca ccg gee aac ccg get age ccc gat caa 
leu cys ser val his pro ala asn pro ala thr pro ala asn pro ala ser pro asp gin 
901/301 931/311 

cga ggg ttt egg tgc egg tec ggg gca tgg cca tec get gag ctg gcg ate tgg act acg 
arg gly phe arg cys arg ser gly ala trp pro ser ala glu leu ala ile trp thr thr 
961/321 991/331 

ttg gtg tag aaa aat cct gee gec egg acc ctt aag get ggg aca att tct gat age tac 
leu val)AMB(lys asn pro ala ala arg thr leu lys ala gly thr ile ser asp ser tyr 
~ ^ 1021734 1"S£& lfc>Ko.€<# ■ 7 "1051-A351 * — - — — 



ccc gac aca gga ggt tac ggg atg age aat teg cgc cgc cgc tea etc agg tgg tea tgg 

pro asp thr gly gly tyr gly met ser asn ser arg arg arg ser leu arg trp ser trp 

1081/361 1111/371 

ttg ctg age gtg ctg get gec gtc ggg ctg ggc ctg gee acg gcg ccg gec cag gcg gee 

leu leu ser val leu ala ala val gly leu gly leu ala thr ala pro ala gin ala ala 
1141/381 

ccg ccg gee ttg teg cag gac egg tt 

pro pro ala leu ser gin asp arg) 



SEQ ID /t<75.^7-S9V 
FIGURE 50A 



177/185 



1/1 31/11 

^«bk(o^s atc cgg cgg ggc ggg tgt cgg cgc agg cgt ggc tgg cgg tca ° gg C " tgC g " ° 9g tgC 
< - t vi.le arg arg gly gly cys arg arg arg arg gly trp arg ser arg arg cys gly arg cys 
<3eo> fb Ko *%> 61/21 91/31 

cgc cgg get gtg ggg cgc cgg cgg cgg cgg tgg caa tgg cgg gaa cgg cgc cga tgc caa 
arg arg ala val gly arg arg arg arg arg trp gin trp arg glu arg arg arg cys gin 
121/41 151/51 

cat cgt cag cgg tgg aga cgg tgg cct cgg cgg tgc cgg tgg cgg tgg cgg atg get eta 
his arg gin arg trp arg arg trp pro arg arg cys arg trp arg trp arg met ala leu 
181/61 211/71 

cgg cga cgg cgg ggc cgg cgg aca cgg cgg aca agg cgc aat cgg cct cgg cgg cgg cgc 
arg arg arg arg gly arg arg thr arg arg thr arg arg asn arg pro arg arg arg arg 
241/81 271/91 

cgg cgg cga cgg ggg cca ggg cgg cgc cgg ccg cgg act gtg ggg tac tgg cgg cgc cgg 
arg arg arg arg gly pro gly arg arg arg pro arg thr val gly tyr trp arg arg arg 
301/101 331/111 

cgg aca cgg cgg gca agg cgg tgg tac cgg ggg ccc acc get gee cgg tca ggc agg cat 
arg thr arg arg ala arg arg trp tyr arg gly pro thr ala ala arg ser gly arg his 
361/121 391/131 

ggg cgc cgc ggg tgg cgc cgg tgg get gat cgg caa cgg cgg ggc cgg cgg cga cgg cgg 
gly arg arg gly trp arg arg trp ala asp arg gin arg arg gly arg arg arg arg arg 
421/141 451/151 

tgt cgg cgc gtc cgg cgg ggt cgc cgg agt agg cgg tgc cgg cgg gaa cgc cat get gat 
cys arg arg val arg arg gly arg arg ser arg arg cys arg arg glu arg his ala asp - 
481/161 511/171 

cgg gca cgg cgg cgc cgg cgg cgc cgg cgg aga cag cag ttt cgc taa tgg cgc ggc cgg 
arg ala arg arg arg arg arg arg arg arg arg gin gin phe argJOCH^trp arg gly arg 
541/181 571/191 3oS>'/t)kb.^97 ' 

cgg cgc ggg cgg tgc cgg agg gca cct ctt cgg caa tgg cgg gtc cgg cgg cca cgg; cgg 
arg arg gly arg cys arg arg ala pro leu arg gin trp arg val arg arg pro arg arg 
601/201 631/211 

age cgt cac ggc cgg caa cac cgg tat cgg tgg cgc cgg cgg cgt cgg tgg gga cgc cag 
ser arg his gly arg gin his arg tyr arg trp arg arg arg arg arg trp gly arg gin 
661/221 691/231 

get gat cgg cca cgg tgg cgc cgg cgg tgc cgg cgg gga ccg cgc cgg age ctt ggt tgg 
ala asp arg pro arg trp arg arg arg cys arg arg gly pro arg arg ser leu gly trp 
721/241 751/251 

ccg tga cgg cgg gec cgg tgg gaa cgg ggg cgc tgg cgg cca get ata cgg caa cgg cgg 
pro\oPA(arg arg ala arg trp glu arg gly arg trp arg pro ala ile arg gin arg arg 
781/261 SCSI «5 KC- ?W 811/271 

cga cgg cgc ccc cgg cac cgg cgg aac act gca ggc ggc ggt gag cgg att ggt gac ggc 
arg arg arg pro arg his arg arg asn thr ala gly gly gly glu arg ile gly asp gly 
841/281 871/291 

ttt gtt cgg tgc acc cgg cca acc cgg cga cac cgg cca acc cgg eta gec ccg ate aac 
phe val arg cys thr arg pro thr arg arg his arg pro thr arg leu ala pro ile asn 
901/301 931/311 

gag ggt ttc ggt gec ggt ccg ggg cat ggc cat ccg ctg age tgg cga tct gga eta cgt 
glu gly phe gly ala gly pro gly his gly his pro leu ser trp arg ser gly leu arg 
961/321 991/331 

tgg tgt aga aaa ate ctg ccg ccc gga ccc tta agg ctg gga caa ttt ctg ata get acc 
trp cys arg lys ile leu pro pro gly pro leu arg leu gly gin phe leu xle ala thr 
1021/341 1051/351 

ccg aca cag gag gtt acg gga tga aca att cgc gee gec get cac tca ggt ggt cat ggt 

J pro thr' gin glTT val thr glylOPA^la iafc-arg-al-a~ala-al^ 

1081/361 Sc&iOKoOT? Hll/371 

tgc tga gcg tgc tgg ctg ccg teg ggc tg<J gec tgg cca cgg cgc cgg ccc agg cgg ccc 

cys)0PA(ala cys trp leu pro ser gly trp ala trp pro arg arg arg pro arg arg pro 

1141/381-2£& xV^^a-^CO 

cgc cgg cct tgt cgc agg acc ggt t 

arg arg pro cys arg arg thr gly) 



SEQ ID flc&. 3?£>-#X> 
FIGURE SOB 



178/185 



\ U1 31/11 

-Scto ,r>i^o Qot tec ggc ggg gcg ggt gtc ggc gca ggc gtg get ggc ggt cac ggc ggt gcg ggc ggt gcc 
^^ Qo ^ ser gly gly ala gly val gly ala gly val ala gly gly his gly gly ala gly gly ala 
61/21 91/31 

gcc ggg ctg tgg ggc gcc ggc ggc ggc ggt ggc aat ggc ggg aac ggc gcc gat gcc aac 
ala gly leu trp gly ala gly gly gly gly gly asn gly gly asn gly ala asp ala asn 
121/41 151/51 

ate gtc age ggt gga gac ggt ggc etc ggc ggt gcc ggt ggc ggt ggc gga tgg etc tac 
ile val ser gly gly asp gly gly leu gly gly ala gly gly gly gly gly trp leu tyr 
• 181/61 211/71 

ggc gac ggc ggg gcc ggc gga cac ggc gga caa ggc gca ate ggc etc ggc ggc ggc gcc 
gly asp gly gly ala gly gly his gly gly gin gly ala ile gly leu gly gly gly ala 
241/81 271/91 

ggc ggc gac ggg ggc cag ggc ggc gcc ggc cgc gga ctg tgg ggt act ggc ggc gcc ggc 
gly gly asp gly gly gin gly gly ala gly arg gly leu trp gly thr gly gly ala gly 
301/101 331/111 

gga cac ggc ggg caa ggc ggt ggt ace ggg ggc cca ccg ctg ccc ggt cag gca ggc atg 
glv his gly gly gin gly gly gly thr gly gly pro pro leu pro gly gin ala gly met 
561/121 391/131 

ggc gcc gcg ggt ggc gcc ggt ggg ctg ate ggc aac ggc ggg gcc ggc ggc gac ggc ggt 
gly ala ala gly gly ala gly gly leu ile gly asn gly gly ala gly gly asp gly gly 
421/141 451/151 

gtc ggc gcg tec ggc ggg gtc gcc gga gta ggc ggt gcc ggc ggg aac gcc atg ctg ate 
val gly ala ser gly gly val ala gly val gly gly ala gly gly asn ala met leu ile • 
481/161 511/171 

ggg cac ggc ggc gcc ggc ggc gcc ggc gga gac age agt ttc get aat ggc gcg gcc ggc 
. alv his gly gly ala gly gly ala gly. gly asp ser, ser phe ala asn gly ala ala gly 
541/181 571/191 

ggc gcg ggc ggt gcc gga ggg cac etc ttc ggc aat ggc ggg tec ggc ggc cac ggc gga 
gly ala gly gly ala gly gly his leu phe gly asn gly gly ser gly gly his gly gly 
601/201 631/211 

gcc gtc acg gcc ggc aac acc ggt ate ggt ggc gcc ggc ggc gtc ggt ggg gac gcc agg 
ala val thr ala gly asn thr gly ile gly gly ala gly gly val gly gly asp ala arg 
661/221 691/231 

ctg ate ggc cac ggt ggc gcc ggc ggt gcc ggc ggg gac cgc gcc gga gcc ttg gtt ggc 
leu ile gly his gly gly ala gly gly ala gly gly asp arg ala gly ala leu val gly 
721/241 751/251 

cgt gac ggc ggg ccc ggt ggg aac ggg ggc get ggc ggc cag eta tac ggc aac ggc ggc 
arg asp gly gly pro gly gly asn gly gly ala gly gly gin leu tyr gly asn gly gly 
78i/261 811/271 

gac ggc gcc ccc ggc acc ggc gga aca ctg cag gcg gcg gtg age gga ttg gtg acg get 
asp gly ala pro gly thr gly gly thr leu gin ala ala val ser gly leu val thr ala 
841/281 871/291 

ttg ttc ggt gca ccc ggc caa ccc ggc gac acc ggc caa ccc ggc tag ccc cga tea acg 
leu phe gly ala pro gly gin pro gly asp thr gly gin pro gly\AMB(pro arg ser thr 
901/301 931/311 ' '3e$ i&Ko.Qe3 

agg gtt teg gtg ccg gtc egg ggc atg gcc ate cgc tga get ggc gat ctg gac tac gtt 
arg val ser val pro val arg gly met ala ile arg)oPA(ala gly asp leu asp tyr val 
961/321 991/331 , b n&<fotf 

ggt gta gaa aaa tec tgc cgc ccg gac cct taa ggc tgg gac aat ttc tga tag eta ccc 
alv val glu lys ser cys arg pro asp pro)oCH(gly trp asp asn pheJOPA AMBfleu pro 
1021/341 1051/351 -5£fli i^Uc> Qo^ ^ Q0{p 

I cga.cac agg agg tta egg gat gag caa ttc gcg ccg ccg etc act cag gtg gtcjitg gtt 

arg his arg arg leu arg asp glu gin phe ala pro pro leu thr gin val val met val 
1081/361 1111/371 

get gag cgt get ggc tgc cgt egg get ggg cct ggc cac ggc gcc ggc cca ggc ggc ccc 
ala glu arg ala gly cys arg arg ala gly pro gly his gly ala gly pro gly gly pro 
1141/381 

gcc ggc ctt gtc gca gga ccg gtt 
ala gly leu val ala gly pro val) 



SEQ ID Jto8.<?0/-<?46? 



FIGURE 50C 



179/185 



Coding sequence Rv0125 predicted by Cole et al., 1998 (Nature 393:537-544) and 
containing seq50A: 

1/1 31/11 
f^iQ>Ko 3©7 atg age aat teg cgc cgc cgc tea etc agg tgg tea tgg ttg ctg age gtg ctg get gcc 
^ <? iket ser asn ser arg arg arg ser leu arg trp ser trp leu leu ser val leu ala ala 

se£ir>Ko.«c*F i/2i 9l/31 

gtc ggg ctg ggc ctg gcc acg gcg ccg gcc cag gcg gcc ccg ccg gcc ttg teg cag gae 
val gly leu gly leu ala thr ala pro ala gin ala ala pro pro ala leu ser gin asp 
121/41 151/51 

egg ttc gcc gae ttc ccc gcg ctg ccc etc gae ccg tec gcg atg gtc gcc eaa gtg ggg 
arg phe ala asp phe pro ala leu pro leu asp pro ser ala met val ala gin val gly 
181/61 211/71 

cca cag gtg gtc aac ate aac acc aaa ctg ggc tac aac aac gcc gtg ggc gcc ggg acc 
pro gin val val asn ile asn thr lys leu gly tyr asn asn ala val gly ala gly thr 
241/81 271/91 

ggc ate gtc ate gat ccc aac ggt gtc gtg ctg acc aac aac cac gtg ate gcg ggc gcc 
gly ile val ile asp pro asn gly val val leu thr asn asn his val ile ala gly ala 
301/101 331/111 

acc gae ate aat gcg ttc age gtc ggc tec ggc eaa acc tac ggc gtc gat gtg gtc ggg 
thr asp ile asn ala phe ser val gly ser gly gin thr tyr gly val asp val val gly 
361/121 391/131 

tat gae cgc acc cag gat gtc gcg gtg ctg cag ctg cgc ggt gcc ggt ggc ctg ccg teg 
tyr asp arg thr gin asp val ala val leu gin leu arg gly ala gly gly leu pro ser 
421/141 451/151 

gcg gcg ate ggt ggc ggc gtc gcg gtt ggt gag ccc gtc gtc gcg atg ggc aac age ggt 
ala ala ile gly gly gly val ala val gly glu pro val val ala met gly ash ser gly 
481/161 511/171 

ggg cag ggc gga acg ccc cgt gcg gtg cct ggc agg gtg gtc gcg etc ggc caa acc gtg 
gly gin gly gly thr pro arg ala val pro gly arg val val ala leu gly gin thr val 
541/181 571/191 

cag gcg teg gat teg ctg acc ggt gcc gaa gag aca ttg aac ggg ttg ate cag ttc gat 
gin ala ser asp ser leu thr gly ala glu glu thr leu asn gly leu ile gin phe asp 
601/201 631/211 

gcc gcg ate cag ccc ggt gat teg ggc ggg ccc gtc gtc aac ggc eta gga cag gtg gtc 
ala ala ile gin pro gly asp ser gly gly pro val val asn gly leu gly gin val val 
661/221 691/231 

ggt atg aac acg gcc gcg tec gat aac ttc cag ctg tec cag ggt ggg cag gga ttc gcc 
gly met asn thr ala ala ser asp asn phe gin leu ser gin gly gly gin gly phe ala 
721/241 751/251 

att ccg ate ggg cag gcg atg gcg ate gcg ggc cag ate cga teg ggt ggg ggg tea ccc 
ile pro ile gly gin ala met ala ile ala gly gin ile arg ser gly gly gly ser pro 
781/261 811/271 

acc gtt cat ate ggg cct acc gcc ttc etc ggc ttg ggt gtt gtc gae aac aac ggc aac 
thr val his ile gly pro thr ala phe leu gly leu gly val val asp asn asn gly asn 
841/281 871/291 

ggc gca cga gtc caa cgc gtg gtc ggg age get ccg gcg gca agt etc ggc ate tec acc 
gly ala arg val gin arg val val gly ser ala pro ala ala ser leu gly ile ser thr 
901/301 931/311 

_ ggc- gac g t g-a t rc-a cc-geg^g tc^ga c-ggc^g c t-ccg-a t c^aa c _ t cg_ gcp_a^c_gcg„at_g_gc^g_gac 

gly asp val ile thr ala val asp gly ala pro ile asn ser ala thr ala met ala asp 
961/321 991/331 

gcg ctt aac ggg cat cat ccc ggt gae gtc ate teg gtg acc tgg caa acc aag teg ggc 
ala leu asn gly his his pro gly asp val ile ser val thr trp gin thr lys ser gly 
1021/341 1051/351 

ggc acg cgt aca ggg aac gtg aca ttg gcc gag gga ccc ccg gcc tga 
gly thr arg thr gly asn val thr leu ala glu gly pro pro ala) OPA 



SEQ ID Wcs.^>7-90B 



FIGURE SOD 



180/185 



ORF according to Cole et al., 1998 (Nature 393:537-544) and containing Rv0125: 

^ 1/1 31/11 
SCO «>Ko.909 ta g aaa aat cct gcc gcc egg acc ctt aag get ggg aca att tct gat age tac ccc gac 

- AMB Ays asn pro ala ala arg thr leu lys ala gly thr ile ser asp ser tyr pro asp 

t5£<&lt>Ao-q/0 61/ £ y 91/31 

aca gga ggt tac ggg atg age aat teg cgc cgc cgc tea etc agg tgg tea tgg ttg ctg 
thr gly gly tyr gly met ser asn ser arg arg arg ser leu arg trp ser trp leu leu 
121/41 151/51 

age gtg ctg get gcc gtc ggg ctg ggc ctg gcc acg gcg ccg gcc cag gcg gcc ccg ccg 
ser val leu ala ala val gly leu gly leu ala thr ala pro ala gin ala ala pro pro 
181/61 211/71 

gcc ttg teg cag gac egg ttc gcc gac ttc ccc gcg ctg ccc etc gac ccg tec gcg atg 
ala leu ser gin asp arg phe ala asp phe pro ala leu pro leu asp pro ser ala met 
241/81 271/91 

gtc gcc caa gtg ggg cca cag gtg gtc aac ate aac acc aaa ctg ggc tac aac aac gcc 
val ala gin val gly pro gin val val asn ile asn thr lys leu gly tyr asn asn ala 
301/101 331/111 

gtg ggc gcc ggg acc ggc ate gtc ate gat ccc aac ggt gtc gtg ctg acc aac aac cac 
val gly ala gly thr gly ile val ile asp pro asn gly val val leu thr asn asn his 
361/121 391/131 

gtg ate gcg ggc gcc acc gac ate aat gcg ttc age gtc ggc tec ggc caa acc tac ggc 
val ile ala gly ala thr asp ile asn ala phe ser val gly ser gly gin thr tyr gly 
421/141 451/151 

gtc gat gtg gtc ggg tat gac cgc acc cag gat gtc gcg gtg ctg cag ctg cgc ggt gcc . 
val asp val val gly tyr asp arg thr gin asp val ala val leu gin leu arg gly ala 
481/161 511/171 

ggt ggc ctg ccg teg gcg gcg ate ggt ggc ggc gtc gcg gtt ggt gag ccc gtc gtc gcg 
gly" gly leu pro ser ala ala ile gly gly gly val ala val gly glu pro val val ala 
541/181 571/191 

atg ggc aac age ggt ggg cag ggc gga acg ccc cgt gcg gtg cct ggc agg gtg gtc gcg 
met gly asn ser gly gly gin gly gly thr pro arg ala val pro gly arg val val ala 
601/201 631/211 

etc ggc caa acc gtg cag gcg teg gat teg ctg acc ggt gcc gaa gag aca ttg aac ggg 
leu gly gin thr val gin ala ser asp ser leu thr gly ala glu glu thr leu asn gly 
661/221 691/231 

ttg ate cag ttc gat gcc gcg ate cag ccc ggt gat teg ggc ggg ccc gtc gtc aac ggc 
leu ile gin phe asp ala ala ile gin pro gly asp ser gly gly pro val val asn gly 
721/241 751/251 

eta gga cag gtg gtc ggt atg aac acg gcc gcg tec gat aac ttc cag ctg tec cag ggt 
leu gly gin val val gly met asn thr ala ala ser asp asn phe gin leu ser gin gly 
781/261 811/271 

ggg cag gga ttc gcc att ccg ate ggg cag gcg atg gcg ate gcg ggc cag ate cga teg 
gly gin gly phe ala ile pro ile gly gin ala met ala ile ala gly gin ile arg ser 
841/281 871/291 

ggt ggg ggg tea ccc acc gtt cat ate ggg cct acc gcc ttc etc ggc ttg ggt gtt gtc 
gly gly gly ser pro thr val his ile gly pro thr ala phe leu gly leu gly val val 
901/301 931/311 

gac aac aac ggc aac ggc gca cga gtc caa cgc gtg gtc ggg age get ccg gcg gca agt 
asp asn asn gly asn gly ala arg val gin arg val val gly ser ala pro ala ala ser 
961/321 991/331 

etc ggc ate tec acc ggc gac gtg ate acc gcg gtc gac ggc get ccg ate aac teg gcc 
leu gly ile ser thr gly asp val ile thr ala val asp gly ala pro ile asn ser ala 
1021/341 1051/351 

^c~gcg atg gcg~gac"gcg ctt aac -ggg-cat-cat-cec- ggt gac gtc- ate -tcg^gtg^acc tgg 

thr ala met ala asp ala leu asn gly his his pro gly asp val ile ser val thr trp 
1081/361 1111/371 
•caa acc aag teg ggc ggc acg cgt aca ggg aac gtg aca ttg gcc gag gga ccc ccg gcc 
gin thr lys ser gly gly thr arg thr gly asn val thr leu ala glu gly pro pro ala) 
1141/381 
tga 
OPA 
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Experiment of molecular hybridization of a specific to DP428 on 
tne genomic DNA of various mycobacterial species 



12 3456789 10111213 



1,6kb 
1 kb 



/ 



1: M. tuberculosis 2: M. bovis 3: BCG 4: M. africanum 5: cancelled 6: M. 
fortuitum 7: M. simiae 8 : AT. avium 9: M. chelonae 10: M. flavescens 11: M. 
gordonae 12: M. marinum 13: M. kansasii 
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